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^5 (54) Tide: CHTTOBIASE AS A REPORTER ENZYME 

^ (57) Abstract: The present invention relates to reporter gene constructs encoding a cytoplasmic form of chitobiase (N,N*-diacetyl- 
chitobiase) and methods of using these reporter gene constructs. The use of a cytoplasmic form of chitobiase as a reporter enzyme 
^ is generally applicable to the study of gene expression in organisms which do not contain N-acety 1 -p-D-g rucosaminidases . 
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CHITOBIASE AS A REPORTER ENZYME 



GOVERNMENT INTEREST IN THE INVENTION 
Work performed in developing the present invention was supported, in part, by National Science Foundation 
5 grant MCB-9507209, NIH MBRS grant GM45765, and NIH NIGMS MARC F31 GM14967-04S1. Accordingly, the 
United States Government may have certain rights in the invention. 

Background of the Invention 

Field of the Invention 

Reporter enzymes are enzymes whose activities are easily assayed when present inside cells. In order to 
10 study the regulation of a gene whose expression is regulated by various environmental and/or cellular factors or 
influences, a gene encoding a reporter enzyme may be fused to the coding region or to the regulatory region of the 
regulated gene. 

One of the most popular cytoplasmic reporter enzymes for use in bacteria is B-galactosidase. It is widely 

used in the art; however, because bacteria such as Escherichia coli contain an endogenous B-galactosidase encoding 
15 gene and B-galactosidase may be present in the cytoplasm of such bacteria, deletions of the LacZ gene, the source of 

the enzyme, must be introduced into the host cell line prior to its use. One goal of the present invention is to provide 

an alternative intracellular enzyme for use as a reporter. 

This invention relates to genetic constructs and methods of using a cytoplasmic form of the chitobiase 

enzyme as a reporter. The invention also comprises expression vectors which express the cytoplasmic form of 
20 chitobiase. As used herein, all instances of the terminology "chitobiase" refer to a form of chitobiase which is present 

in the cytoplasm of the cell. Cytoplasmic forms of chitobiase may be generated via genetic engineering or microbial 

selection techniques. 

Summary of the Invention 

One embodiment of the present invention is a method for characterizing a promoter comprising providing a 
25 construct comprising the promoter operably linked to a nucleic acid encoding a cytoplasmic form of chitobiase, 
introducing the construct into host cells, and identifying sequences in the promoter which regulate transcription levels. 
In one aspect of this embodiment the cytoplasmic form of chitobiase lacks a signal sequence. In another aspect of 
this embodiment, the nucleic acid encoding a cytoplasmic form of chitobiase encodes a fusion protein which comprises 
a cytoplasmic form of chitobiase fused to a heterologous polypeptide. In still another aspect of this embodiment, the 
30 nucleic acid encoding a cytoplasmic form encodes a cytoplasmic form of chitobiase obtained from an organism 
selected from the group consisting of Alteromonas sp. 07, Arabidopsis thai/ana. Bacillus subtiDs, Bombyx mori, Bos 
taurus, Caenorhabditis elegans, Candida albicans, Dictyostelium discoideum, Entamoeba histolytica, Fells catus, Homo 
sapiens, Karat cats, Lactobacillus casei, Leishmania donovani, Mus musculus, Pisum sativum, Porphyromonas 
gingh/aSs, Pseudoalteromonas sp. S9, Rattus norvegicus, Serratia marcescens, Streptomyces plicatus, Streptomyces 
35 thermoviolaceus, Sus scrota, Trichoderma hamanum, Vibrio furnissii, Vibrio harveyi , Vibrio parahaemofyticus, and 
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Vibrio vulnificus. In a further aspect of this embodiment, the method of identifying sequences which are involved in 
directing transcription comprises mutagenizing the promoter. In another aspect of this embodiment the method of 
identifying sequences which are involved in transcription comprises constructing deletions in the promoter. 

Another embodiment of the present invention is a method for identifying a regulatory element capable of 
5 directing or regulating transcription within a test nucleic acid sequence comprising providing a construct comprising 
the test nucleic acid sequence operably linked to a nucleic acid encoding a cytoplasmic form of chitobiase, introducing 
the construct into host cells, and determining the level of chitobiase activity. In one aspect of this embodiment, the 
cytoplasmic form of chitobiase lacks a signal sequence. In another aspect of this embodiment, the nucleic acid 
encoding a cytoplasmic form of chitobiase encodes a fusion protein, the fusion protein comprising a cytoplasmic form 

10 of chitobiase fused to a heterologous polypeptide. In still another aspect of this embodiment, the nucleic acid 
encoding a cytoplasmic form encodes a cytoplasmic form of chitobiase obtained from an organism selected from the 
group consisting of Alteromonas sp. 07, Arabidopsis tbaliana, Bacillus subtilis, Bombyx mori, Bos taunts, 
Caenorhabditis elegans, Candida albicans, Dictyostefium discoideum, Entamoeba histolytica, Felis catus, Homo 
sapiens, Korat cats, Lactobacillus casei, Leisbmania donovani, Mus musculus, Pisum sativum, Porphyromonas 

15 gingivaBs, Pseudoatteromonas sp. S9, Rattus norvegicus, Serratia marcescens, Strep tomyces pKcatus, Streptomyces 
tbermoviolaceus, Sus scrota, Trichoderma hamanum, Vibrio furnissii, Vibrio barveyi , Vibrio parahaemofyticus, and 
Vibrio vulnificus. In one aspect of this embodiment, the reporter gene construct is introduced transiently. In another 
aspect of this embodiment the reporter gene construct is introduced stably. The host cells may be selected from the 
group consisting of prokaryotic cells and eukaryotic cells. In another aspect of this embodiment the method further 

20 comprises permeabilizing or lysing the host cells. The permeabilizing or lysing step may comprise treating the host 
cells with toluene. The step of determining the level of chitobiase activity may be selected from the group consisting 
of measuring the amount of a chemiluminescent product produced from a substrate, measuring the amount of a 
fluorescent product produced from a substrate, measuring the amount of light absorbed by a product produced from a 
substrate and measuring a decrease in the amount of a detectable substrate. In another embodiment the step of 

25 determining the level of chitiobiase activity may comprise determining the level of /Miitrophenol released from a 
substrate. In another aspect of this embodiment the test nucleic acid sequence comprises a portion of genomic DNA. 
In a further aspect of this embodiment the step of determining the level of chitobiase activity comprises determining 
the level of chitobiase activity after exposing the host cells to a dBsired set of environmental conditions. In still 
another aspect of this embodiment the step of determining the level of chitobiase activity comprises determining the 

30 level of chitobiase activity after contacting the host cells with a compound to be tested for its influence on the level of 
transription from siad regulartory element. 

Another embodiment of the present invention is a method of detecting successful transformation, comprising 
the steps of introducing a nucleic acid encoding a cytoplasmic form of chitobiase into host cells, and detecting 
chitobiase expression in the host cells. 
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Another embodiment of the present irwnetion is a fusion protein-reporter gene construct comprising a 
promoter operably linked to a nucleic acid encoding a cytoplasmic form of chitobiase fused in frame with a nucleic acid 
encoding a heterologous polypeptide, wherein the heterologous polypeptide is not P-galactosidase or a portion thereof, 
and wherein the heterologous polypeptide does not contain a signal peptide. In one aspect of this embodiment the 
5 nucleic acid encodes a cytoplasmic form of chitobiase obtained from an organism selected from the group consisting 
of Atteromonas sp. 0-7, Arabidopsis tbaffana, Bacillus subtilis, Bombyx mori, Bos tauws, Caenorbabditis elegans, 
Candida albicans, Dictyostelium discoideum, Entamoeba histolytica, Felis catus, Homo sapiens, Korat cats, 
Lactobacillus casei, Leishmania donovani, Mus musculus, Pisum sativum, Porphyromonas gingivalis, 
Pseudoalteromonas sp. S9, Rattus norvegicus, Serratia marcescens, Streptomyces plicatus, Streptomyces 
10 tbermovidaceus, Sus scrota, Trichoderma harzianum, Vibrio furnissii, Vibrio barveyi, Vibrio parabaemolyticus, and 
Vibrio vulnificus. In another aspect of this embodiment the nucleic acid further comprises a X site-specific 
recombination sequence. 

Another embodiment of the present invention is a reporter gene construct comprising plasmid pJMFl 
Another embodiment of the present invention is a reporter gene construct comprising plasmid pJMF4. 
1 5 Another embodiment of the present invention is a reporter gene construct comprising plasmid pD YK9. 

Another embodiment of the present invention is a reporter gene construct comprising plasmid pDYK1 1. 
Another embodiment of the present invention is a host cell comprising a fusion protein-reporter gene 
construct comprising a promoter operably linked to a nucleic acid encoding a cytoplasmic form of chitobiase fused in 
frame with a nucleic acid encoding a heterologous polypeptide, wherein the heterologous polypeptide is not 0- 
20 galactosidase or a portion thereof, and wherein the heterologous polypeptide does not contain a signal peptide. In one 
aspect of this embodiment, the nucleic acid is integrated into a chromosome of the cell. In another aspect of this 
embodiment, the nucleic acid is transiently expressed in the host cell. 

Another embodiment of the present invention is a nucleic acid encoding a cytoplasmic form of chitobiase in 
which the signal sequence of native chitobiase has been inactivated or deleted. In one aspect of this embodiment the 
25 signal sequence has been mutated to inactivate it. 

An isolated or purified polypeptide comprising a cytoplasmic form of chitobiase fused in frame with a 
heterologous polypeptide, wherein the heterologous polypeptide is not p-galactosidase or a portion thereof and 
wherein the heterologous polypeptide does not contain a signal peptide. 

Another embodiment of the present invention is an isolated or purified polypeptide comprising a cytoplasmic 
30 form of chitobiase in which the signal peptide of native chitobiase has been inactivated or deleted. In one aspect of 
this embodiment the signal sequence has been mutated to inactivate it. 

Another embodiment of the present invention is a method for monitoring the activity of a promoter 
comprising providing a construct comprising the promoter operably linked to a nucleic acid encoding a cytoplasmic 
form of chitobiase, introducing the construct into host cells, and determining the level of chitobiase activity. In one 
35 aspect of this embodiment the cytoplasmic form of chitobiase lacks a signal sequence. In another aspect of this 



-3- 



WO 01/27322 PCT/USO0/2JO48 
embodiment the nucleic acid encoding a cytoplasmic form of chitobiase encodes a fusion protein, the fusion protein 
comprising a cytoplasmic form of chitobiase fused to a heterologous polypeptide. In one aspect of this embodiment, 
the nucleic add encoding a cytoplasmic form encodes a cytoplasmic form of chitobiase obtained from an organism 
selected from the group consisting of Meromonas sp. 0-7, Arabidopsis thaliana, Bacillus subtiTis, Bombyx mori. Bos 
5 tarns, Caenorbabditis elegans, Candida albicans, Dictyostelium discoideum, Entamoeba histolytica, FaSs catus, Homo 
sapiens, Koret cats, Lactobacillus casei, Leishmania donovani, Mus musculus, Pisum sativum, Porphyromonas 
gingivals, Pseudoalteromonas sp. S9, Rattus norvegicus, Serratia marcescens, Streptomyces plicatus, Streptomyces 
tharmoviolaceus, Sus scrota, Trichoderma bawanum, Vibrio furnissS, Vibrio harveyi , Vibrio parabaemofyticus, and 
Vibrio vulnificus. In another aspect of this embodiment, the reporter gene construct is introduced transiently. In still 

10 another aspect of this embodiment the reporter gene construct is introduced stably. In a further aspect of this 
embodiment the host cells are selected from the group consisting of prokaryotic cells and eukaryotic cells. In yet 
another aspect of this embodiment the method further comprises permeabilizing or lysing the host cells. For example, 
the permeabilizing or lysing step may comprise treating the host cells with toluene. In yet another aspect of this 
embodiment, the step of determining the level of chitobiase activity may be selected from the group consisting of 

15 measuring the amount of a chemiluminescent product produced from a substrate, measuring the amount of a 
fluorescent product produced from a substrate, measuring the amount of light absorbed by a product produced from a 
substrate and measuring a decrease in the amount of a detectable substrate. In still another aspect of this 
embodiment, the step of determining the level of chitiobiase activity comprises determining the level of /7-nitrophenol 
released from a substrate. In still another aspect of this embodiment, the step of determining the level of chitobiase 

20 activity comprises determining the level of chitobiase activity after exposing the host cells to a desired set of 
environmental conditions. In a further aspect of this embodiment the step of determining the level of chitobiase 
activity comprises determining the level of chitobiase activity after contacting the host cells with a compound to be 
tested for its influence on the level of transription from siad regulartory element. For example, the compound may 
comprise a compound to be tested for activity as a drug. 

25 Brief Description of the Drawings 

Figure 1 illustrates plasmids pJMF3 and pJMF4 containing attP in 2 different orientations and the lac 
promoter with the first 21 amino acids of lacZa (from pUC19) fused in-frame to the ebb gene. The sequence of the 
fusion region is shown in Figure 2. Restriction sites shown are found once in the plasmid sequences except for Not\, 
which has 2 sites flanking the PI 5A origin. 

30 Figure 2 illustrates the sequence of the lac promoter and the chitobiase fusion found in pJMF3 and pJMF4 

(SEQ ID NOS: 15 and 16). Fusion between lacZa (from pUC19) and ebb [Soto-Gill, R.W. et al, J. Biol. Chem. 
264:14778-14782 (1998)] is indicated by (/); start of transcription is indicated by (+ 1). Sequence and binding sites in 
the be promoter regulatory region are found in Dickson, R.C. et al., Science 187:27-32 (1975). Restriction enzyme 
sites shown in Figure 2 are found once in the plasmid sequences except for Spb\ t which has 2 sites; these different 
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restriction sites can be used to replace the lac promoter with another promoter together with part of a coding region 
to produce an in-frame fusion with chb. 

Figure 3 depicts integration of chb fusions into the chromosome by site-specific recombination between attB 
and attP. The steps involved are described below and in Diederich, L.L, et a I., Plasmid 28:14*24 (1992). 
5 Figure 4 illustrates plasmids pDYK9 and pDYK1 1 which contain dnaA-chb fusions. pDYK9 is deleted for the 

rpmH regulatory region. The orientation of attP in pDYK9 and pDYKII is the same as that of pJMF3. After 
integration at attB of the larger Naft fragment the dnaA promoters are oriented to transcribe in the same direction as 
replication fork movement 

Figure 5 illustrates the rwpH-dnaA regulatory region and dte/J-chitobiase fusion (SEQ ID NO.: 17). The dnaA 
10 box and promoters are shaded, and the coding region of the dnaA gene is striped. The fusion contains two amino 
acids (between the backslashes) from pUC19. The region cloned into pDYK9 is between Sph\ primer II and Kpn\ 
primer, and the region cloned into pDYK1 1 is between Sph\ primer I and Kpn\ primer. The numbers above the primers 
refer to the nucleotide in the sequence amplified. For sequence numbering and the locations of promoters and protein 
binding sites, see Froelich, J.M. et al., J. Bacterid. 178:6006-6012 (1996); Hansen, F.G. et aL, EMBO J. 1:1043- 
15 1048 (1982) and Messer, W. and C.W. Weigel, Initiation of Chromosome Replication, p. 1579-1601 (1996) In F.C. 
Neidhart, R. Curtiss III, J.L Ingraham, E.CC. Lin, K.B. Low, B. Magasanik, W.S. Reznikoff, M. Riley, M. Schaechter 
and H.E. Umbarger (Eds.), Escherichia coli and Salmonella Cellular and Molecular Biology., ASM Press, 
Washington, D.C. 

Detailed Description of the Preferred Embodiment 

20 Reporter genes and reporter gene constructs play a number of important roles in a variety of molecular 

biology techniques. For example, reporter genes may be used to determine whether a sequence contains a promoter or 
other cis-acting element which directs transcription, such as an enhancer. In addition, reporter genes may be used to 
identify regulatory sites in promoters or other cis-acting elements and to determine the effects of mutating these 
regulatory sites on the level of gene expression directed by the promoters or other cis-acting elements. Reporter 

25 genes may also be used to detect successful transformation. In addition, reporter genes may be used to monitor gene 
expression under various conditions and to identify drugs. 

The structure of a reporter gene construct containing the cytoplasmic form of chitobiase will vary according 
to its purposes. When the reporter gene construct is a vector, one must decide between a vector that incorporates 
the reporter gene into the host's genome or one that replicates extrachromosomally, such as a plasmid. When the 

30 reporter construct is not designed to integrate into the host genome, the vector can contain an origin of replication 
with activity in the host cell of interest This feature provides the reporter gene vector the ability to replicate within 
the host cell in which it has been introduced. In addition to the origin of replication, reporter constructs often contain 
a promoter, a multiple cloning site, a selectable marker, and of course a reporter gene. Reporter constructs for use in 
eukaryotic cells may also contain a polyA site adjacent to the reporter gene. 
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Given the utility of reporter gene constructs, it is not surprising that a number of cytoplasmic reporter gene 
constructs and different reporter genes are available for use by those of skill in the art. For example, the cytoplasmic 
reporter enzymes chloramphenicol acetyltransferase (CAT), firefly lucif erase, [^-glucuronidase (GUS), green fluorescent 
protein (GFP), and B-galactosidase have been used extensively. However, such reporters all have individual 
5 shortcomings that may limit or preclude their usage under some conditions. For example, high levels of GFP are toxic 
to the ceQ. In addition, reporter enzymes are not expressed equally in all cell types nor are they equally stable when 
expressed in all cell types. Furthermore, there is a recognized need for multiple reporter enzymes that can be assayed 
independently of one another in order to simultaneously study the regulation of multiple genes within a single cell type. 
Therefore, there exists a continuing need to identify reporter enzymes with useful properties. 
10 The cytoplasmic enzyme B-galactosidase is widely used as a reporter gene in various microbiological and 

molecular biological studies. This enzyme is used in both in vitro and in vivo assays. The wide acceptance of this 
reporter system results, in part, because it is norwsotopic and extremely flexible. It is used in a number of assay 
formats and has an extremely broad linear range. Nevertheless, because B-galactosidase is present in the cytoplasm 
of various host cells such as Escherichia coli, deletion of the lacZ gene, the source of the enzyme, is often required 
15 prior to its use in a host cell system. One goal of the present invention was to provide an alternative intracellular 
enzyme for use as a reporter. 

An extensive discussion of various molecular biology techniques is available in Ausubel, et al., (eds) "Short 
Protocols in Molecular Biology, " Wiley and Sons, Inc., New York (1997). Examples of such techniques include 
isolating and preparing DNA for manipulation, get electrophoresis, polymerase chain reaction (PCR), determining 
20 nucleic acid sequences, screening nucleic acid libraries, mutagenesis of DNA, and introducing DNA into host cells. 

The structure of a reporter gene construct will vary according to its purposes. The reporter gene constructs 
are constructed according to standard techniques of molecular biology well known in the art. 

When the reporter gene construct is a vector, one must decide between a vector that incorporates the 
reporter gene into the host's genome or one that replicates extrachromosomally, such as a plasmid. When integration 
25 of the reporter gene is a desired result, a reporter gene construct will contain sequences that will facilitate 
incorporation. 

One example of integration sequences that can be included in a reporter gene construct is the X attP site. 
This site permits a single copy of the reporter gene to be incorporated into a host bacterial genome. Integration- 
promoting sequences with utility in mammalian ceils include the long terminal repeats found in retroviral genomes. 
30 These sequences promote viral genome integration in a host genome and have been used extensively by those of skill 
in the art to promote the integration of exogenous sequences in mammalian host cells. 

When the reporter construct is not designed to integrate into the host genome, it is common that the vector 
contain an origin of replication with activity in the host cell of interest. This feature provides the reporter gene vector 
the ability to replicate within the host cell in which it has been introduced. 
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In addition to the origin of replication, reporter constructs will often contain additional features that promote 
the expression of the nucleic acid sequence or sequences contained in the construct. These additional sequences can 
include a poly A site, a multiple cloning she, a drug resistance marker, and of course a reporter gene. 

The present invention relates to the use of chitobiase as a reporter gene. The chitobiase may be used as a 
5 reporter in bacteria, plants, mammalian cells and other host cell lines. One possible alternative to using 6- 
galactosidase as a reporter gene was to develop cytoplasmic N,N'-diacetylchitobiase (AAacetyl-ft-D-glucosaminidase, 
EC 3.2.1.30) for use as a reporter enzyme. One advantage of the enzyme /V-acetyl-B-D-glucosaminidase or 
"chitobiase" over R-galactosidase is that genes encoding chitobiase are missing from many bacteria, including £ coR, 
some fungi, and some eukaryotic cells. Thus, it is not necessary to engineer many host cells to lack reporter activity 

10 as is the case with B-galactosidase. The present invention also relates to various protein expression vectors that can 
be used to express the reporter gene. In addition, the present invention may be used in conjunction with other reporter 
enzymes in systems in which the regulation or activities of multiple genes is to be studied simultaneously. 

Chitobiase is one of two enzymes that hydrolyze ch'rtin, an abundant insoluble polysaccharide, to its 
monomeric unit, /V-acetylglucosamtne (GlcNac). Chitobiase is known to be present in a number of organisms. For 

1 5 example, the chitobiase enzyme is known to be present in various genera including Arabidopsis, Bacillus, Bombyx, Bos, 
Caenorhabditis, Candida, Dictyostelium, Entamoeba, FeSs, Homo, Korat, Lactobacillus, Leishmania, Mus, Pisum, 
Porphyromonas, Psaudoalteromonas, Rattus, Senatia, Streptomyces, Sus, Tricboderma, and Vibrio, Specific examples 
of organisms known to contain chitobiase include Alteromonas sp. 0-7, Arabidopsis tbaliana, Bacillus subtilis, Bombyx 
mori, Bos taurus, Caenorhabditis elegans, Candida albicans, Dictyostelium discoideum, Entamoeba histolytica, Felis 

20 catus. Homo sapiens, Korat cats, Lactobacillus casei, Leishmania donovani, Mus musculus, Pisum sativum, 
Porphyromonas gingivalis, Pseudoalteromonas sp. S9, Rattus norvegicus, Senatia marcescens, Streptomyces plicatus, 
Streptomyces tbermovio/aceus, Sus scrofa, Tricboderma harzianum, Vibrio furnissii, Vibrio harveyi , Vibrio 
parahaemofyticus, and Vibrio vulnificus. 

One source of the enzyme is the marine bacterium, Vibrio harveyi. Escherichia coli ceils harboring a plasmid 

25 carrying the chb gene from Vibrio harveyi were reported to produce the enzyme, which was found to be associated 
with the outer membrane of the bacterial cells. These are described in Jannatipour, M. et al., "Translocation of Vibrio 
harveyi ^JlT-diacetylchitobiase to the outer membrane of Escherichia col," J. Bacteriol. 755:3785-3791 (1987) and 
Soto-Gil & Zyskind, /MT-Diacetylchitobiase of Vibrio harveyi primary structure, processing, and evolutionary 
relationships. J. Biol Chem.-?«i14778-14782 (1989). 

30 The present invention contemplates expressed cytoplasmic forms of chitobiase in various forms. In one 

embodiment, the signal sequence is deleted from the amino terminal portion of the protein. Presumably the removal of 
this sequence results in the expression of a cytoplasmic form of the enzyme that is not secreted from the host cell or 
incorporated into the membrane of the host cell producing the enzyme. 

The present invention also contemplates the generation of fusion proteins comprising a fusion polypeptide 

35 joined in frame to chitobiase. Preferably, the fusion polypeptide comprises a polypeptide other than chitobiase, such 
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as a heterologous protein. The heterologous polypeptide may comprise a polypeptide having a biological activity (such 
as an enzymatic or other activity besides activity as an immunogen) or the heterologous polypeptide may not have a 
biological activity. The heterologous polypeptide does not include a signal sequence which directs its secretion. 
Preferably the heterologous polypeptide is not p-galactosidase or a portion thereof. Thus, the fusion reporter gene 
5 construct contains a sequence encoding the fusion polypeptide genetically fused in frame with a sequence encoding 
chitobiase. In one onbodiment this fusion may remove the ammo-terminal signal peptide sequence of chitobiase and 
replace it with a heterologous protein. 

In another embodiment the fusion protein construct comprises a chitobiase gene sequence that has been 
truncated to remove at least the signal peptide sequence of the gene. Alternatively, mutations may be introduced into 
10 the signal peptide sequence so that it is no longer functional Such nutations may be introduced using a variety of 
techniques familiar to those skilled in the art including site directed mutagenesis, cassette mutagenesis, and chemical 
mutagenesis. 

Once the reporter gene construct is made it is introduced into a host cell line for testing. Host cells of 

prokaryotic and eukaryotic origin can be used with the reporter gene constructs of the present invention. A variety of 
15 methods are available to introduce reporter gene constructs into host prokaryotic cells. For example, bacteria can be 

transformed using calcium chloride, electroporation, or viral vectors such as the filamentous phages. These and other 

prokaryotic transformation protocols are well known in the art 

Alternatively, the sequence encoding chitobiase may be introduced in eukaryotic cells, including yeast, 

mammalian, plant, and insect cells. For example, the sequence encoding chitobiase may be inserted into a yeast 
20 artificial chromosome, a yeast plasmid, a bovine papilloma virus vector or other extrachromosomal vector, a retroviral 

vector, a Ti-plasmid, or a baculovirus vector. A variety of such vectors are known to those skilled in the art. The 

vectors may be introduced into any of the yeast, mammalian, plant, and insect cells familiar to those skilled in the art. 
The introduction of the reporter gene construct into mammalian cells can likewise utilize a number of 

transfection protocols well known to those of skill in the art. As discussed above, transfections can be transient or 
25 stable. Examples of suitable transfer protocols include calcium phosphate transfection, DEAE-Dextran, 

electroporation, liposome-mediated transfection, and viral transfection. These and other eukaryotic transformation 

protocols are well known in the art. 

Following introduction of the reporter gene construct into the host cell of interest the enzymatic activity of 

the reporter gene is measured. Preferably, the chitobiase assays are performed after permeabilizing or lysing the host 
30 cells. There are a variety of cell permeabilization and cell lysis procedures available to those of ordinary skill in the 

art, including methods such as sonication or lysozyme treatment. One embodiment of the present invention uses 

toluene treatment to permeahilize ceils. The details of this method are discussed in D.Y. Kalabat et al., BioTechniques 

25:1030-1035 (1998) and Miller , J.H. A Short Course in Bacterial Genetics, CSH Laboratory Press, Cold Spring 

Harbor, NY 1992. 
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Cellular chitobiase activity can be measured quantitatively by following the hydrolysis of chitobiase 
substrates. Examples of substrates with utility in chitobiase activity assays include /K/V'diawtylchitobiose 
(chitobioseL^nitrophenYW-a^l-frDflhJcosaminide (PNAGKSigma Chemical, St. Louis, MO), and 5-bromo4-chioro- 
3-indolyl#acetyl*D-glucosaminide (X-Gluc|(Sigma Chemical, St. Louis, MO). Other substrates are also contemplated 
5 for use in the assays of the present invention. 

Products produced by the hydrolysis of the chitobiase substrates are monitored using various means familiar 
to those skilled in the art. For example, various optical means are known to those skilled in the art. One such optical 
means may comprise detection of chemiluminescent or fluorescent products released from a substrate. Alternatively, 
the level of chitobiase activity may be determined by measuring the amount of light absorbed by a product produced 
10 from a substrate or measuring a decrease in the amount of a detectable substrate. In one embodiment /Miitrophenol is 
released from the substrate and measured at 400 nm. Other monitoring methods well known in the art can be used to 
quantitate signals produced in the chitobiase assay. These may inlcude use of radioactive substrates or substrates 
having radiofrequency tags. In another embodiment, blue/white colony indicator plates are used to monitor enzyme 
activity. 

15 Another embodiment of the present invention is a kit. One aspect of this embodiment includes a reporter 

gene construct comprising a vector containing a chitobiase reporter gene. The reporter gene construct also contains a 
multiple cloning site containing a variety of restriction endonuclease cutting sites that facilitate the introduction of 
exogenous DNA into the construct 

The kit embodiment of the present invention also includes those components necessary to assay for 
20 chitobiase activity produced by the reporter gene construct. For example, in one embodiment, the kit will include a 
supply of a suitable chitobiase substrate whose metabolism into product by the reporter enzyme can be assayed. 

EXAMPLES 

The following Examples are disclosed to assist in the understanding of the present invention. The Examples 
below should not be construed to limit the scope of the invention and such variations of the invention now known or 
25 later developed, which would be within the purview of one of ordinary skill in the art and are considered to fall within 
the scope of the invention hereinafter claimed. 

Example 1 
Construction of Vectors 

Vectors were constructed using polymerase chain reaction (PCR) products that were cloned first at the Sma\ 
30 or EcofN restriction sites of plasmid pBluescript II (pKSII+KStratagene; San Diego, CA). The nucleotide sequence of 
all PCR products was determined using standard techniques well known in the art. Unmethylated plasmid DNA was 
isolated from an E coGdam strain when cutting with the BcA enzyme was required. 

The pDYK9 (SEQ ID NO: 11) plasmid was constructed by ligating a SphVKpn\ PCR product containing the 
dnaA promoter region into plasmid pRSG196 that contains the V. hamyi chitobiase gene (see Jannatipour, M., et alj. 
35 Briefly, the K hamyi chitobiase gene was prepared by cloning a Sau3A partial digest of V. harveyi DNA into the 



WO 01/27322 PCT/USOO/21048 
single BamW\ site with the tet gene of pMK20Q4 [See Soto-Gill & Zyskind, "Cloning of Vibrio harveyi chitinase and 
chitobiase genes in Escherichia coli," pp. 209-223, In J.P. Zikakis (ed.), Chitin, chitosaa and related enzymes/ 
Acadenrac Press, Inc. New York (1984)]. The clones containing the gene of interest were detected by the presence of 
the yellow p-nitrophenylate product after individual colonies of the clone bank were spray ed with tO mM PNAG 
5 dissolved in 100 mM sodium phosphate, pH 7.0. Restriction maps of the positive donas were made according to 
standard techniques well known in the art 

The chb gene was subcloned in a 3.5-kb EcoH\ fragment from pRSG14 [See Jannatipour, M. et al.. 
Translocation of Vibrio harveyi A^/I/'-diacetylchitobiase to the outer membrane of Escherichia coli." J. Bacterid!. 
/5A3785-3791 (1987)] into the Eco?A site of pl)C19 [Yanisch-Perron, cJ. et al.. Gene 33: 103-1 19 (1985)]. Plasmid 
10 pRSG196 was constructed by deleting a 0.5-kb Ssft fragment from one of these clones [See Jannatipour, M. et al., 
"Translocation of Vibrio harveyi /V,rt/'diacetylchitobiase to the outer membrane of Escherichia coli.' J. Bacterid. 
7ff£3785-3791 (1987)]. 

The pRSG196 plasmid was also cut with Sphl an&Kpnl to accommodate the insertion of the dnaA fragment. 
Plasmid pAC17 [described in Chiaramello & Zyskind, "Coupling of DNA replication to growth rate in Escherichia coir, a 

15 possible role for guanosine tetraphosphate," J. Bacterid. 172:2013-2019 (1992)] served as template with primers 5'- 
GCA CAT GCA TGC TGG TCA TTA AAT TTT CC-3' (SEQ ID NO 1) and 5'-CGG GGT ACC AAC TCA TCC TGC AAT CG- 
3' (SEQ ID NO 2) producing a PCR product 374 bp long that contains 353 bp from the dnaA promoter region [bases 
583 to 935, numbering according to Hansen, F. G. r et al., The nucleotide sequence of the dnaA gene promoter and of 
the adjacent rpmH gene, coding for the ribosomal protein L34, of Escherichia coli" EMBO J. 1:1043-1048 (1982)]. 

20 The forward primer \Sph\ primer II) contained an Sph\ site and the reverse primer \Kpn\ primer) contained a Kpn\ site 
for cloning. This created an in-frame fusion between the amino- terminal 17 amino acids of DnaA and the carboxy- 
terminal end of chitobiase deleted for the ammo-terminal 22 amino acids including the signal peptide. 

The next step in constructing the pDYK9 vector was taking the 3270 bp DraV Hini\\\ (partial digest) 
fragment containing the dnaA-chb fusion and ligating it to the pACYC184 2555 bp M/?cll-////?dlII fragment carrying 

25 chloramphenicokesistance (CmO and the PI 5A origin. This fragment was described in Chang & Cohen, "Construction 
and characterization of amplifiable multicopy DNA cloning vehicles derived from the p15A cryptic miniplasmid," J. 
Bacterial. 134:1141-1156(1987). 

A Nod site was introduced at the Acc\ site after digestion with Acc\, treatment with Mung bean nuclease, 
and ligation to phosphorylated Not\ linkers (New England Biolabs, Inc.; Beverly, MA). A Nofi site was introduced at 

30 the Ase\ site after digestion with Asel treatment with Mung bean nuclease, and ligation to phosphorylated Not\ 
linkers (New England Biolabs, Inc.; Beverly, MA). An XbaVSph\ PCR product containing the rrnBt1t2 terminator was 
ligated into this plasmid cut with the same enzymes creating pDYK7. E coli chromosomal DNA served as template 
with primers 5'-CTA 6TC TAG ATG CC6 AAC TCA GAA GTG A-3' (SEQ ID NO 3) and 5'-GCA CAT GCA TGC GGG 
GGA TGG CTT GTA GAT-3' (SEQ ID NO 4) to produce a PCR product 357 bp long that contains bases 6534 to 6869 

35 from the rrnB operon [numbering according to Brosius, J., et al., "Gene organization and primary structure of a 
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ribosomal RNA oparon from Escherichia coli." J. Mol. Biol. 148:107-127 (1981)] and includes the complex 
transcription termination region of this operon which is described in Orosz, A., et al., "Analysis of the complex 
transcription termination region of the Escherichia coti rrnB gene/ Eur. J. Biochem. 207:653-659 (1991). The 
forward primer contained an Xba\ site and the reverse primer contained an Sph\ site. 
5 A BcHSma\ PCR product containing the X attP site was ligated into pDYK7 cut with Tthftfl treated with 

Mung bean nuclease, then digested with BdL Plasmid pHN894 [described in Goodman, S. D. t et al., "Deformation of 
DNA during site-specific recombination of bacteriophage lambda: Replacement of IHF protein by HU protein or 
sequence-directed bends," Proc. Natl. Acad. USA ££11910-11914 (1992)] served as template with the forward 
primer 5 f -CAT GAT CAT GCG ACA GGT TTG ATG A-3' (SEQ ID NO 5} and the reverse primer 5' GGG GGC GCC TAC 

10 CTT TCA CGA G-3' (SEQ ID NO 6) producing a PCR product 466 bp long that contains the X attP site. The 466 bp 
PCR product containing the X attP site was first cloned into the Sma\ site of the pKSII + plasmid to produce pDYK8. 
The forward primer contains a Bcti site and the reverse primer contains G's at the 5' end in order to recreate a Sma\ 
site when cloning the PCR fragment into a Sma\ site. This PCR product includes bases -21 1 to +241 from the center 
of the attP core and the sequence required for optimum X attP site integration as described in Nagaraja & Weisberg, 

15 "Specificity determinants in the attachment sites of bacteriophages HK022 and X, 9 J. Bacteriol. 172:6540-6550 
(1990). The orientation of attP is such that when the fusion is integrated at attB, the transcription direction of 
dnaApt and p2 promoters is the same as replication fork movement mimicking the orientation at the wild type dnaA 
promoters. 

Vector pOYK1 1 (SEQ ID NO: 12) was constructed by ligating an SphVKpnX PCR product containing the rpmH- 

20 dnaA promoter region into pDYK9 also digested with Sph\ and Kpnl Plasmid pAC17, described in Chiaramello & 
Zyskind, (1992), served as template with the primers 5'CAT GCA TGC ATG AAA CGA TGG ACA CC-3' (SEQ 10 NO 7) 
and 5'CGG GGT ACC AAC TCA TCC TGC AAT CG-3' (SEQ ID NO 8) to produce a PCR product 616 bp long that 
contains 598 bp from the rpmH-dnaA regulatory region [bases 338 to 935, numbering according to Hansen, F. G., et 
al., (1 982)]. The forward primer (Sph\ primer I) contained an Sph\ site and the reverse primer (A/wl primer) contained a 

25 Kpnl site for cloning. SEQ ID NO: 18 provides the complete coding sequence of the DNA encoding the dnaA/chitobiase 
fusion protein. The complete sequence of the dnaA/chitobiase fusion protein is provided in SEQ ID NO: 19. 

The vector pJMF3 (SEQ ID NO: 13) was constructed by first ligating an As<A linker (5'-CATTAATGCATG-3 / 
(SEQ ID NO 9) self-hybridized) into the Sph\ site of pDYKU. The pUC19, [described in Yanisch-Perron C, et al., 
"Improved M13 phage cloning vectors and host strains: nucleotide sequences of the M13mp18 and pUC19 vectors," 

30 Gene ££103-19 (1985)] AseVKpn\ fragment containing the &cft7-polylinker region was ligated to this plasmid after 
digestion with Ase\ and Kpnl The resulting in-frame fusion between the ammo-terminal 21 amino acids of the pUC19 
lacAa) peptide [Yanisch-Perron C, et al., 19851 and the carfaoxy-terminal end of chitobiase deleted for the amino- 
terminai 22 amino acids is identical to the protein fusion in pRSG196 [Jannatipour, M„ et al., (1995)1. 

Vector pJMF4 (SEQ ID NO: 14 ) was constructed by ligating the BamWVEcoM fragment of pDYK8 

35 containing the X ettPzWe into pDYK7 which had been digested with Tthftfl treated with Mung bean nuclease, then 
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digested with BcA, to create pTKP9. This reversed the orientation of the attP site relative to the attP site in pDYK9. 
The pDYK1 1 BspMMpnl fragment containing the rpmHdnaA promoter region was then ligated into pTKP9 digested 
with the same enzymes to create pTKP1 1. This reversed the orientation of the attP site relative to the attP site in 
pDYK11. An Ase\ linker |5'-CATTAATGCATG-3' (SEQ ID NO 10) self-hybridized) was ligated into the SpM site of 
5 pTKPII to create pJMF2. The pUC19 [as described in Yanisch-Perron C et al.. (1985)] AseVKprii fragment 
containing thefec/Yfyolyfinker region was ligated to pJMF2 cut with Ase\ and KpiA to mate pJMF4. 

Example 2 
Site-specific Recombination 
To move the chitobiase fusions in pDYK9 and pDYKl 1 to the attB site in the chromosome, Not\ fragments 
10 from these plasmids were self -ligated and transformed or electroporated into strain WM2269 (DH5a containing 
pLDRB) [See Zyskind, J. W. and S, I. Bernstein (1992) "Recombinant DNA Laboratory Manual," Academic Press. San 
Diego, CA]. Plasmid pLDRB, described in Diederich, L L. J., et al., "New cloning vectors for integration into the X 
attachment site attB of Escherichia col/ chromosome," Plasmid ^£14-24 (1 992), expresses integrase from the X P R 
promoter and contains the X c\&j repressor gene, a kanamycin resistance gene, and a temperature-sensitive origin of 
15 replication. The transformed or electroporated cells were incubated at 42°C with shaking for 30 min then moved to 
37°C for 1 h followed by selection on Luria broth agar plates containing chloramphenicol (25 jug/ml) at 42°C. 
Transformants were screened for loss of kanamycin resistance and, therefore, loss of pLDR8. 

Example 3 
Bacteriophage PI Transduction 

20 Transduction with P1 bacteriophage by the method of Zyskind & Bernstein, Recombinant DNA Laboratory 

Manual. Academic Press, San Diego, CA (1992) was used to construct strains and to confirm the chromosomal 
location of the dnaA-chb fusions. Cotransduction of Cm r (carried by the fusion) and galK (linked to attB) indicated 

that Cn/ and galK are linked on the chromosomes of strains DYK9W, DYK9F, DYK1 1 W, and DYK1 1 F. 

Example 4 

25 fl-Nitrophenol Chitobiase Activity Assay 

Chitobiase activity is located in the cytoplasm when its signal peptide is replaced by fusion with another 
peptide. Accordingly, chitobiase assays are performed on permeabilized or tysed cells. A variety of permabilization or 
cell lysis protocols are available to liberate the enzyme from within a cell population being tested. One such protocol 
involves toluene-treated cells washed once with M9 salts according to the method of Miller, J.H. A Short Course in 

30 Bacterial Genetics, CSH Laboratory Press, Cold Spring Harbor, NY 1992. The toluenized cells are placed in a 
chitobiase buffer (10 mM Tris-HCI, pH 8.0, and 0.5 M NaCI) and 666 //M PNAG, the chitobiase enzyme substrate. 
NaCI is included because chitobiase has approximately 80% of full activity in the absence of salt, with maximal 
activity occurring between 0.25 and 0.6 M NaCI. Toluenized cells (0.772 ml) are preincubated at 28°C and the 
reaction started with the addition of 0.228 ml PNAG (1 mg/ml). After incubation at 28°C, the reaction is stopped by 

35 the addition of 1 ml of 1 M Tris base. The release of p-nitrophenol is measured at 400 nm and turbidity at 550 nm. p> 
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Nitrophenol release is measured immediately at 400 nm with a molar absorptivity of 1.8 x 10 3 liters mot 1 cm' 1 . Units 
are calculated after subtracting the light scattering factor (1.5 x OD^ from OD^ of the sample. The normalizing 
factor of 1.5 was determined previously by measuring the light scattering ratio of bacteria at OD^and QD 550 . One 
unit of chitobiase activity is the amount of enzyme that catalyzes the formation of 1 pmol of p-nitrophenol per min at 
5 28°C. For comparison to Miller units of B-galactosidase [described in Miller, J. H., A Short Course in Bacterial 
Genetics. Cold Spring Harbor Laboratory Press, Cold Spring Harbor, NY (1992)1 the units are normalized to 1 ml of 
culture at OD^-1. 

Example 5 

Determination of Chitobiase Activity in Cells Containing Vectors pJM3 and pJM4 
1 0 (Figure 1 ) contain the lacPO promoter with the first 2 1 amino adds of facZa (from pUC 1 9) fused in-frame to 

the chb gene. These plasmids also contain the X phage attP recombination site in different orientations, the gene 
encoding chloramphenicol acetykransf erase {cat), and a ribosomal terminator, rmBtU2, inserted upstream of the lac- 
chb fusion to prevent read-through from other promoters. The chitobiase activity associated with these plasmids 
(Table 1) is high in the absence of IPTG because of titration of lac repressor expressed from a single copy 
15 chromosomal gene. Induction by IPTG is approximately 10-fold (Table 1). 

Table 1. Chitobiase Activity oUacZ-chb Fusion* 



Chitobiase' (in 



Plasmid -IPTG +IPTG(1mM) 



pJMF3 in DH5a 668 ± 45 9320 ± 347 

pJMF4 in DH5a 788 ±44 7188 ±477 

■ Overnight cultures were diluted 1:1000 into 50 mL prewarmed LB and grown to 0D 45o - 0.1. 1 mM IPTG was 

added to half of the culture, and growth continued to 0D 450 - 0.3. 
b Triplicate samples were assayed. Mean chitobiase activities are given with standard deviations. 
c One unit of chitobiase activity is 1 pmol of //-nitrophenol/min at 28°C. Units given for 1 mL of culture at 

OP45Q-1- 



20 

The lac promoter can be replaced with another promoter and a fusion protein created with chitobiase by 
cutting with Sph\ or AsA and either Sail, KprA, or Ssti (Figure 2). Fusions created with these vectors can be moved to 
the chromosome by site specific recombination at the X attB site to permit single copy analysis of the activity of the 
promoter. The protocol, as described more fully in Diederich, et a!., "New cloning vectors for integration into the X 
25 attachment site attB of Escherichia colt chromosome," Plasmid 2&14-24 (1992), involves two components, (i) a 
circular DNA containing the X attachment site, attP t the promoters gene fusion, and the cat gene, and fii) a helper 
plasmid, pLDR8, which contains the int gene under the control of the temperature-sensitive repressor, cl857 and a 
temperature-sensitive origin of replication. The plasmid is digested with liloti to remove the P15A origin, and the 
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fragment containing the chitobiase fusion is selMigated prior to transformation into cells containing pLDR8. 
Integration occurs by site-specific recombination between attP and attB (17.4 min on the £ cob' chromosome, 
Figure 3). 

Example 6 

5 Use of the Chitobiase Reporter Enzyme to Study dnaA Gene regulation 

Two plasmids, pDYK9 and pDYKII, discussed in the Examples above, were constructed to assess the 
regulation of the dnaA gene using chitobiase as a reporter enzyme. These plasmids differ by the absence of the rpmH 
regulatory region in pDYK9 (Figure 4). These fusions were moved from the plasmid to the chromosomal attB site for 
single copy analysis as described above. After transformation of strain WM2269 with the ligated DNA, integration 

10 occurred by site specific recombination between the attP and the attB sites. The orientation of attP in pDYK9 and 
pDYKII is such that when the fusion is integrated at attB, the transcription direction of dnaApt and dnaAp2 
promoters is the same as movement of the replication fork. This orientation is the same as at the dnaA wild type 
location. The genetic location of the fusions was confirmed in the Cnf transformants by demonstrating 

cotransduction of Cm r and galK. 
15 The fusions created in strain WM2269 were moved by PI transduction to MG1655, creating strain DYK9W 

with pDYK9 and strain DYK11W with pDYKII. Deletion of the rpmH promoters had very little effect (1.4-fold) on 
chitobiase activity (Table 2, compare lines 1 and 3) 

Table 2. 

20 Chitobiase and (3-Gal Activities of daaA chb and dnaA lacZ Fusions in WT and fis Mutant Backgrounds 



P-GaP 


Strains 


Chitobiase' (Ul 


(Miller U*) 


DYK11Wfc+ 


30.0 ±2.1 




DYK11F/Sr:985 


80.0 ±U 




DYK9W/&+ 


44.0 ±2.1 




DYK9F/fe:985 


96.0 ±3.5 




RB220&+ 




59.8 ±7.9 


TP220/?k:767 




115.2 ±7.2 


' Triplicate samples were assayed during exponential growth. Mean chitobiase activities are given with 
standard deviation. 

" One unit of chitobiase activity is 1 pmol of /j-nitrophenol per min at 28°C. Units given for 1 mL of culture at 

OD4S0 - 1. 
' Data from Reference 8. 
' Unit defined in Reference 13. 
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Fis protein binds to a site in the dnaAp2 promoter that covers the -35 sequence (Figure 5), and appears to be 
a repressor of OnaA expression. A fusion protein with B-galactosidase activity that is expressed from WtorpmH-dnaA 

regulatory region has increased B-galactosidase activity (1 .9-fold) in a fis' mutant when compared to Fis wild type 
cells [as described in Froelich, J. M., et al., "Fis binding in the dnaA operon promoter region/ J. Bacterid. /7&6006- 
5 6012 (1996), data shown in Table 3]. Similarly, the absence of Fis leads to a greater than 2-fold increase in 
chitobiase activity of the DnaA-chitobiase fusion protein for the DYK9F and DYK1 IF strains, comparable in extent to 
that observed with the dnaA-lacZ fusion strain, TP220 (Table 3). 

In the reporter gene constructs discussed in the Example, all upstream transcriptional activity was prevented 
from entering the chb reporter gene. The plasmid vectors, pJMF3 and pJMF4, described in the Examples above, 
10 contained the rmbtU2 terminator upstream of the promoter fusion, which prevented readthrough from chromosomal 
promoters near the insertion site. Only chitobiase activity originating from the promoters of interest was expressed. 

The attP site in these vectors allowed integration at the chromosomal attB in a specific orientation, 
depending on the vector used. With these vectors, any chitobiase fusion involving an essential gene can be moved to 
the chromosome, thus permitting single copy analysis with a chromosomal orientation similar to the wild-type gene. 
15 Example 7 

Identification of Promoters in Test Sequences 
A nucleic acid prospectively containing a promoter is inserted upstream of a nucleic acid encoding a cytoplasmic 
form of chitobiase as described above. For example, the nucleic acid prospectively containing a promoter may be inserted 
into a restriction site in a sequence containing a plurality of restriction sites, such as a polylinker, which is located 
20 upstream of the nucleic acid encoding chitobiase. The test sequence may comprise any nucleic acid to be tested for 
promoter activity. In one embodiment, the test sequence may comprise a genomic DMA sequence. For example, the 
genomic DNA sequence may be a randomly generated DNA fragment, such as a fragment generated using shotgun cloning 
techniques, a restriction fragment, or any other sequence. 

The vectors containing the test sequence upstream of the nucleic acid encoding chitobiase are introduced into an 
25 appropriate host cell The level of chitobiase activity is assayed and compared to the level obtained from a control vector 
which lacks an insert in the cloning site. The presence of an elevated expression level in cell containing the vector 
containing the insert with respect to the level in cells containing the control vector without the insert indicates the 
presence of a promoter in the insert 

In some embodiments, the activity of the promoter in the test sequence may be assayed after exposure of the 
30 host ceOs to conditions which may influence the level of transcription from the promoter. For example, the environment of 
the host cells may be altered to determine whether the transcription level is influenced by environmental factors, including 
factors such as temperature, pH, nutrients, or availability of oxygen. In such analyses, chitobiase levels are assayed under 
a variety of environmental conditions to determine the effects of the environmental conditions on transcription levels from 
the promoter. In addition, the activity of the promoters may be examined in the presence or absence of compounds to be 
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tested for regulatory activity. For example, the activity of the promoters may be tested by determining the levels of 
chitobiase produced in the presence or absence of compounds to be tested for activity as drugs. 

Promoter sequences within the test sequences may be further defined by constructing nested deletions in the 
test sequences using conventional techniques such as Exonudease 111 digestion. The resulting deletion fragments can be 
5 inserted into the promoter reporter vector to determine whether the deletion has reduced or obliterated promoter activity 
as determined by measuring chitobiase activity in cells containing the deletion vectors. In this way, the boundaries of the 
promoters may be defined. If desired, potential individual regulatory sites within the promoter may be identified using 
techniques such as site directed mutagenesis or linker scanning to obliterate potential transcription factor binding sites 
within the promoter individually or in combination. The effects of these mutations on transcription levels may be 
10 determined by inserting the mutations into the cloning sites in the promoter reporter vectors and measuring the levels of 
chitobiase produced from the mutated promoters. 

The activity of known promoters may also be monitored by operably linking them to a nucleic acid encoding a 
cytoplasmic form of chitobiase. The activity of the promoters may be analyzed under various environmental conditions as 
described above. In addition, the activity of the promoters may be analyzed in the presence or absence of compounds to be 
1 5 tested for the ability to affect transcription from the promoters. For example, the compounds may be tested for activity as 
drugs. 

In some embodiments, the chitobiase reporter constructs may be used in systems for identifying compounds that 
modulate cell surface protein-mediated activity or compounds which modulate the activities of intracellular signaling 
systems. Techniques for using reporter genes to identify compounds which modulate cell surface protein-mediated activity 

20 have been described in U.S. Patent Number 5,401,629 and U.S. Patent Number 5,436,128. Briefly, in such methods, a 
construct comprising a promoter operably linked to a nucleic acid encoding a reporter enzyme is introduced into cells which 
express the cell surface protein and cells which do not express the cell surface protein. Each of the cells are contacted 
with test compounds and the effects of these compounds on transcription levels is measured by determining the level of 
activity of the reporter enzyme. The level of expression of the reporter gene in cells expressing the cell surface protein is 

25 compared to the level in cells which do not express the cell surface protein to identify compounds that modulate cell 
surface protein activity. 

Similarly, the chitobiase reporter constructs may be used to identify compounds which influence the activity of 
intracellular signaling pathways, such as cAMP-based or phosphorylation-based pathways. In such methods, a promoter 
which is activated via such pathways is operably linked to a nucleic acid encoding a cytoplasmic form of chitobiase. The 
30 cells are contacted with test compounds. Those compounds which activate the pathway to which the promoter responds 
will produce an enhanced level of chitobiase activity in the cells as compared to the level of chitobiase activity in control 
cells which have not been contacted with the test compound. 
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Examples 

Detecting Successful Transformation or Transfection Using Chitobiase 
A vector comprising a sequence encoding a cytoplasmic form of chitobiase operably linked to a sequence 
capable of directing transcription of the chitobiase gene is introduced into a host cell. The host cells are contacted 
with a chitobiase substrate and those host ceils which contain chitobiase activity are identified as cells which were 
successfully transformed or transfected. In some embodiments, a portion or replica of a colony may be lysed or 
permeabilized prior and the lysate or permeabilized cells may be contacted with the chitobiase substrate. 

CONCLUSION 

New gene reporter systems that use chitobiase have been described. Chitobiase has advantages over other 
reporter gene systems in that chitobiase is not found in many cell lines traditionally used in reporter gene systems. 

Finally, the forgoing examples are not intended to limit the scope of the present invention, which is set forth 
in the following claims. In particular, various equivalents and substitutions will be recognized by those of ordinary skill 
in the art in view of the foregoing disclosure, and these are contemplated to be within the scope of the present 
invention. 
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WHAT IS CLAIMED IS : 

1 . A method for characterizing a promoter comprising: 

providing a construct comprising said promoter operably linked to a nucleic acid encoding a 
cytoplasmic form of chitobiase: 
5 introducing the construct into host cells; and 

identifying sequences in said promoter which regulate transcription levels. 

2. The method of Claim 1, wherein said cytoplasmic form of chitobiase tacks a signal sequence. 

3. The method of Claim 2, wherein said nucleic acid encoding a cytoplasmic form of chitobiase encodes a 
fusion protein, said fusion protein comprising a cytoplasmic form of chitobiase fused to a heterologous polypeptide. 

10 4. The method of Claim 1, wherein said nucleic acid encoding a cytoplasmic form encodes a cytoplasmic 

form of chitobiase obtained from an organism selected from the group consisting of Alteromonas sp. 0-7, Arabidopsis 
thaliana, Bacillus subtiBs, Bombyx mori, Bos taurus, Caenorhabditis elegans, Candida albicans, Dictyostelium 
discoideum. Entamoeba histolytica, Felis catus, Homo sapiens, Korat cats, Lactobacillus casei, Leishmania donovani, 
Mus musculus, Pisum sativum, Porphyromonas gingivals, Pseudoalteromonas sp. S9, Rattus norvegicus, Serratia 

15 marcescens, Streptomyces plica tus, Streptomyces tharmoviolaceus, Sus scrota, Trichoderma harsanum, Vibrio 
furnissii, Vibrio harveyi , Vibrio parabaemofyticus, and Vibrio vulnificus. 

5. The method of Claim 1, wherein said method of identifying sequences which are involved in directing 
transcription comprises mutagenuing said promoter. 

6. The method of Claim 1, wherein said method of identifying sequences which are involved in 
20 transcription comprises constructing deletions in said promoter. 

7. A method for identifying a regulatory element capable of directing or regulating transcription within a 
test nucleic acid sequence comprising: 

providing a construct comprising said test nucleic acid sequence operably linked to a nucleic acid 
encoding a cytoplasmic form of chitobiase; 
25 introducing said construct into host cells; and 

determining the level of chitobiase activity. 

8. The method of Claim 7, wherein said cytoplasmic form of chitobiase lacks a signal sequence. 

9. The method of Claim 8, wherein said nucleic acid encoding a cytoplasmic form of chitobiase encodes a 
fusion protein, said fusion protein comprising a cytoplasmic form of chitobiase fused to a heterologous polypeptide. 

30 10. The method of Claim 7, wherein said nucleic acid encoding a cytoplasmic form encodes a cytoplasmic 

form of chitobiase obtained from an organism selected from the group consisting of Alteromonas sp. 0-7, Arabidopsis 
thaliana, Bacillus subtilis, Bombyx mori, Bos taurus, Caenorhabditis elegans, Candida albicans, Dictyostelium 
discoideum, Entamoeba histolytica, Felis catus, Homo sapiens, Korat cats, Lactobacillus casei, Leishmania donovani, 
Mus musculus, Pisum sativum, Porphyromonas gingivalis, Pseudoalteromonas sp. S9, Rattus norvegicus, Serratia 



•18- 



WO 01/27322 PCT/US00/21048 
marcescens, Streptomyces pBcatus, Streptomyces thermoviolaceus, Sus scrota, Trichoderma harzianum, Vibrio 
furnissti, Vibrio harveyi , Vibrio parahaemolyticus, and Vibrio vulnificus. 

1 1. The method of Claim 7, wherein said reporter gene construct is introduced transiently. 

12. The method of Claim 7, wherein said reporter gene construct is introduced stably. 

5 13. The method of Claim 7, wherein said host cells are selected from the group consisting of prokaryotic 

cells and eukaryotic cells. 

14. The method of Claim 7, further comprising permeabilizing or lysing said host cells. 

15. The method of Claim 14, wherein said permeabilizing or lysing step comprises treating said host cells 
with toluene. 

10 16. The method of Claim 7, wherein said step of determining the level of chitobiase activity is selected from 

the group consisting of measuring the amount of a chemiluminescent product produced from a substrate, measuring 
the amount of a fluorescent product produced from a substrate, measuring the amount of light absorbed by a product 
produced from a substrate and measuring a decrease in the amount of a detectable substrate. 

17. The method of Claim 7, wherein said step of determining the level of chitiobiase activity comprises 
1 5 determining the level of p-nitrophenol released from a substrate. 

18. The method of Claim 7, wherein said test nucleic acid sequence comprises a portion of genomic DNA. 

19. The method of Claim 7, wherein said step of determining the level of chitobiase activity comprises 
determining the level of chitobiase activity after exposing said host ceils to a desired set of environmental conditions. 

20. The method of Claim 7, wherein said step of determining the level of chitobiase activity comprises 
20 determining the level of chitobiase activity after contacting said host cells with a compound to be tested for its 

influence on the level of transription from siad regulartory element. 

21. A method of detecting successful transformation, comprising the steps of: 

introducing a nucleic acid encoding a cytoplasmic form of chitobiase into host cells; and 
detecting chitobiase expression in said host cells. 
25 22. A fusion protein-reporter gene construct comprising a promoter operably linked to a nucleic acid 

encoding a cytoplasmic form of chitobiase fused in frame with a nucleic acid encoding a heterologous polypeptide, 
wherein said heterologous polypeptide is not -galactosidase or a portion thereof and wherein said heterologous 
polypeptide does not contain a signal peptide. 

23. The nucleic acid of Claim 22, wherein said nucleic acid encodes a cytoplasmic form of chitobiase 
30 obtained from an organism selected from the group consisting of Alteromonas sp. 0-7, Arabidopsis thaliana, Bacillus 
subtiBs, Bombyx mori. Bos taurus, Caenorhabditis elegans, Candida albicans, Dictyostelium discoideum, Entamoeba 
histolytica, FeJis catus, Homo sapiens, Karat cats, Lactobacillus casei, Leishmania donovani, Mus musculus, Pisum 
sativum, Porphyromonas gingivals, Pseudoalteromonas sp. S9, Rattus norvegicus, Serratia marcescens, Streptomyces 
pBcatus, Streptomyces thermoviolaceus, Sus scrota \ Trichoderma harzianum, Vibrio furnissii, Vibrio harveyi t Vibrio 
35 parahaemolyticus, and Vibrio vulnificus. 
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24. The nucleic acid of Claim 22, further comprising a X site-specific recombination sequence. 

25. A reporter gene construct comprising plasmid pJMF3. 

26. A reporter gene construct comprising pJasmid pJMF4. 

27. A reporter gene construct comprising plasmid pOYK9. 
5 2B. A reporter gene construct comprising plasmid pDYK1 1 . 

29. A host cell comprising the construct of Claim 22. 

30. The host cell of Claim 29 wherein said nucleic acid is integrated into a chromosome of said cell. 

31. The host cell of Claim 29, wherein said nucleic acid is transiently expressed in said host cell. 

32. A nucleic acid encoding a cytoplasmic form of chitobiase in which the signal sequence of native 
1 0 chitobiase has been inactivated or deleted. 

33. The nucleic acid of Claim 32, wherein the signal sequence has been mutated to inactivate it. 

34. An isolated or purified polypeptide comprising a cytoplasmic form of chitobiase fused in frame with a 
heterologous polypeptide, wherein said heterologous polypeptide is not -galactosidase or a portion thereof and 
wherein said heterologous polypeptide does not contain a signal peptide. 

15 35. An isolated or purified polypeptide comprising a cytoplasmic form of chitobiase in which the signal . 

peptide of native chitobiase has been inactivated or deleted. 

36. The polypeptide of Claim 35, wherein the signal sequence has been mutated to inactivate it. 

37. A method for monitoring the activity of a promoter comprising: 

providing a construct comprising said promoter operably linked to a nucleic acid encoding a 
20 cytoplasmic form of chitobiase; 

introducing said construct into host cells; and 
determining the level of chitobiase activity. 

38. The method of Claim 37, wherein said cytoplasmic form of chitobiase lacks a signal sequence. 

39. The method of Claim 38, wherein said nucleic acid encoding a cytoplasmic form of chitobiase encodes a 
25 fusion protein, said fusion protein comprising a cytoplasmic form of chitobiase fused to a heterologous polypeptide. 

40. The method of Claim 37, wherein said nucleic acid encoding a cytoplasmic form encodes a cytoplasmic 
form of chitobiase obtained from an organism selected from the group consisting of Alteromonas sp. 07, Arabidopsis 
thaliana, Bacillus subtilis, Bombyx mod Bos taurus, Caenorhabditis elegans, Candida albicans, DictyosteBum 
discoideum, Entamoeba histolytica, Felis catus, Homo sapiens, Korat cats, Lactobacillus casei, Leishmania donovani, 

30 Mus musculus, Pisum sativum, Porphyromonas gingivalis, Pseudoalteromonas sp. S9, Battus norvegicus, Serratia 
marcescens, Streptomyces plicatus, Streptomyces thermoviolaceus, Sus scrota, Trichoderma harzianum, Vibrio 
fumissii, Vibrio harveyi, Vibrio parahaemolyticus, and Vibrio vulnificus. 

41. The method of Claim 37, wherein said reporter gene construct is introduced transiently. 

42. The method of Claim 37, wherein said reporter gene construct is introduced stably. 
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43. The method of Claim 37, wherein said host cells are selected from the group consisting of prokaryotic 
cells and eukaryotic cells. 

44. The method of Claim 37, further comprising permeabilizing or lysing said host cells. 

45. The method of Claim 44, wherein said permeabilizing or lysing step comprises treating said host cells 
with toluene. 

46. The method of Claim 37, wherein the step of determining the level of chitobiase activity is selected 
from the group consisting of measuring the amount of a chemiluminescent product produced from a substrate, 
measuring the amount of a fluorescent product produced from a substrate, measuring the amount of light absorbed by 
a product produced from a substrate and measuring a decrease in the amount of a detectable substrate. 

47. The method of Claim 37, wherein said step of determining the level of chitiobiase activity comprises 
determining the level of /7-nrtrophenDl released from a substrate. 

48. The method of Claim 37, wherein said step of determining the level of chitobiase activity comprises 
determining the level of chitobiase activity after exposing said host cells to a desired set of environmental conditions. 

49. The method of Claim 37, wherein said step of determining the level of chitobiase activity comprises 
determining the level of chitobiase activity after contacting said host cells with a compound to be tested for its 
influence on the level of transription from siad regulartory element. 

50. The method of Claim 49, wherein said compound comprises a compound to be tested for activity as a 

drug. 
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SEQ ID NO: 15: 







CAP-cAMP "1 


SphI 


Asel 


binding site j 



-35 -10 

r 



GCATGCATTAATGTGAGTTAGCTCACTCATTAGGCACCCCAXS^ 



ITGG 



Lac Repressor 1 
+ 1 binding site | SphI Sail 

AATTGTGAGCGGATAACAATT^^ 

MetThrMetlloThrProSorLeuHisAlaCysArgSerThr 

Kpril Sstl 



CTAGAGGATCCCCGGGTACCG/AG^^ 

I^uGluAspPrc^gValPro/SerSerGluGlnGlnValVa^ 
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ODD 



pJMF3 Notl fragment 
lacPOZ'-'chb 



attP 



P O P 



cat 



direction of 
replication fork 
movement 




b o r 



cat 



E. coli 

chromosome 



integrase (pLDR8) 
IHF 

lacPOZ'-'chb P O W 



m 



gal 



bio 



Figure 3 
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SEQUENCE LISTING 



<110> ELITRA PHARMACEUTICALS, INC. 
Zyskind, Judith W. 

<120> CHITOBIASE AS A REPORTER ENZYME 

<130> ELITRA. 006VPC 

<150> 60/159,221 
<151> 1999-10-13 

<160> 19 

<170> FastSEQ for Windows Version 4.0 

<210> 1 
<211> 29 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Oligonucleotide 
<400> 1 

gcacatgcat gctggtcatt aaattttcc 29 

<210> 2 
<211> 26 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Oligonucleotide 



<210> 3 
<211> 28 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Oligonucleotide 
<400> 3 

ctagtctaga tgccgaactc agaagtga 28 

<210> 4 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Oligonucleotide 



<400> 2 

cggggtacca actcatcctg caatcg 



26 



<400> 4 

gcacatgcat gcgggggatg gcttgtagat 
<210> 5 



30 
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<211> 25 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Oligonucleotide 
<400> 5 

catgatcatg cgacaggttt gatga 25 

<210> 6 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Oligonucleotide 
<400> 6 

gggggcgcct acctttcacg ag 22 

<210> 7 
<211> 26 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Oligonucleotide 



<210> 8 
<211> 26 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Oligonucleotide 
<400> 8 

cggggtacca actcatcctg caatcg 26 

<210> 9 
<211> 12 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Oligonucleotide 



<210> 10 
<211> 12 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Oligonucleotide 



<400> 7 

catgcatgca tgaaacgatg gacacc 



26 



<400> 9 
cattaatgca tg 



12 
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<400> 10 

cattaatgca tg 12 

<210> 11 
<211> 5826 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> pDYK9 
<400> 11 

gaattccgga tgagcattca tcaggcgggc aagaatgtga ataaaggccg gataaaactt 60 

gtgcttattt ttctttacgg tctttaaaaa ggccgtaata tccagctgaa cggtctggtt 120 

ataggtacat tgagcaactg actgaaatgc ctcaaaatgt tctttacgat gccattggga 180 

tatatcaacg gtggtatatc cagtgatttt tttctccatt ttagcttcct tagctcctga 240 

aaatctcgat aactcaaaaa atacgcccgg tagtgatctt atttcattat ggtgaaagtt 300 

ggaacctctt acgtgccgat caacgtctca ttttcgccaa aagttggccc agggcttccc 360 

ggtatcaaca gggacaccag gatttattta ttctgcgaag tgatcttccg tcacaggtat 420 

ttattcggcg caaagtgcgt cgggtgatgc tgccaactta ctgatttagt gtatgatggt 480 

gtttttgagg tgctccagtg gcttctgttt ctatcagctg tccctcctgt tcagctactg 540 

acggggtggt gcgtaacggc aaaagcaccg ccggacatca gcgctagcgg agtgtgcggc 600 

cgcactggct tactatgttg gcactgatga gggtgtcagt gaagtgcttc atgtggcagg 660 

agaaaaaagg ctgcaccggt gcgtcagcag aatatgtgat acaggatata ttccgcttcc 720 

tcgctcactg actcgctacg ctcggtcgtt cgactgcggc gagcggaaat ggcttacgaa 780 

cggggcggag atttcctgga agatgccagg aagatactta acagggaagt gagagggccg 8 40 

cggcaaagcc gtl;tttccat aggctccgcc cccctgacaa gcatcacgaa atctgacgct 900 

caaatcagtg gtggcgaaac ccgacaggac tataaagata ccaggcgttt cccctggcgg 960 

ctccctcgtg cgctctcctg ttcctgcctt tcggtttacc ggtgtcattc cgctgttatg 1020 

gccgcgtttg tctcattcca cgcctgacac tcagttccgg gtaggcagtt cgctccaagc 1080 

tggactgtat gcacgaaccc cccgttcagt ccgaccgctg cgccttatcc ggtaactatc 1140 

gtcttgagtc caacccggaa agacatgcaa aagcaccact ggcagcagcc actggtaatt 1200 

gatttagagg agttagtctt gaagtcatgc gccggttaag gctaaactga aaggacaagt 1260 

tttggtgact gcgctcctcc aagccagtta cctcggttca aagagttggt agctcagaga 1320 

accttcgaaa aaccgccctg caaggcggtt ttttcgtttt cagagcaaga gattacgcgc 1380 

agaccaaaac gatctcaaga agatcatctt atgcggccgc atcagataaa atatttctag 1440 

atgccgaact cagaagtgaa acgccgtagc gccgatggta gtgtggggtc tccccatgcg 1500 

agagtaggga actgccaggc atcaaataaa acgaaaggct cagtcgaaag actgggcctt 1560 

tcgttttatc tgttgtttgt cggtgaacgc tctcctgagt aggacaaatc cgccgggagc 1620 

ggatttgaac gttgcgaagc aacggcccgg agggtggcgg gcaggacgcc cgccataaac 1680 

tgccaggcat caaattaagc agaaggccat cctgacggat ggcctttttg cgtttctaca 1740 

aactcttcct gtcgtcatat ctacaagcca tcccccgcat gctggtcatt aaattttcca 1800 

atatgcggcg taaatcgtgc ccgcctcgcg gcaggatcgt ttacacttag cgagttctgg 1860 

aaagtcctgt ggataaatcg ggaaaatctg tgagaaacag aagatctctt gcgcagttta 1920 

ggctatgatc cgcggtcccg atcgttttgc aggatcttga tcgggcatat aaccgcagac 1980 

agcggttcgt gcgtcaccct caagcagggt cttttcgacg tacgtcaaca atcatgaatg 2040 

tttcagcctt agtcattatc gacttttgtt cgagtggagt ccgccgtgtc actttcgctt 2100 

tggcagcagt gtcttgcccg attgcaggat gagttggtac cgagctctga gcaacaagtt 2160 

gtaaactcac tggctgataa ccttgatatc caatatgaag tgttaactaa ccatggtgct 2220 

aacgaaggtc ttgcgtgcca agatatgggc gcagaatggg cttcttgtaa caaagtaaac 2280 

atgacgcttg ttaaccaagg tgaagctgtt gactcaaaag attgggctat ttacttccac 2340 

agcattcgtc tgattctgga tgttgacaac gagcagttca aaatctctcg tgtaacgggt 24 00 

gacctacata agctagaacc aacagataag tttgacggct tcgctgccgg tgaagaggtt 2460 

gttcttccat tggttggtga atactggcaa ctatttgaaa ctgacttcat gccgggtgca 2520 

ttcgtttctg ctccaaacgc agaacctaag atgattgctt ctctaaatac tgaagatgtt 2580 

gcgtcttttg tgacgggtct tgaaggtaac aacctaaaac gtacaccaga tgacaacaat 2640 

gtatttgcaa acgctgtgtc tcgttttgag aaaaacgaag acctagcaac acaagacgta 2700 

tcaaccacgt tactaccaac accaatgcac gttgaagcgg gtaaaggcaa agtagatatc 27 60 

gcggatggta ttgcgctgcc taaagacgca ttcgatgcga ctcagttcgc agcgattcaa 2820 

gatcgtgcag aagtggtagg tgtggacgtt cgtggtgatc ttcctgtaag catcactgtt 2880 
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gttcctgcag acttcaccgg tgaattagca aaatctggtg cttacgaaat gagcatcaaa 2940 

ggcgacggta ttgtgattaa agcgttcgac caagcaggcg ctttctacgc agtacaatct 3000 

atctttggcc tggtagatag ccaaaatgct gattctctac cacaactgtc tattaaagat 3060 

gcgcctcgtt ttgattaccg tggtgtgatg gtggatgtgg ctcgtaactt ccactctaag 3120 

gacgcaatcc ttgcaacgct agaccaaatg gcagcgtaca agatgaacaa acttcacctt 3160 

cacctaaccg atgatgaagg ctggcgttta gaaatcccgg gtctgcctga gctgacagaa 3240 

gtgggtgcta accgttgttt cgatacacaa gagaaaagct gtttactgcc tcagcttggc 3300 

tctggtccaa cgacagacaa ctttggctct ggctacttca gcaaagcaga ctacgtggaa 3360 

atcttgaaat acgcgaaagc acgtaacatt gaagtgattc cagaaatcga tatgccagct 3420 

cacgctcgtg cagcagtagt atcaatggaa gctcgttacg accgcctaat ggaagaaggt 34 80 

aaagaagctg aagcgaacga ataccgtctg atggatcctc aagatacatc aaacgtaacg 3540 

acggttcagt tctacaataa gcaaagcttc atcaacccat gtatggaatc ttcaactcgc 3600 

tttgttgata aggtgatttc agaagtggca gcaatgcacc aagaagctgg cgctccacta 3660 

acaacttggc acttcggtgg tgacgaagcg aagaacatca agctaggtgc tggtttccaa 3720 

gacgttaacg cagaagataa agtaagctgg aaaggcacga ttgacctgtc taaacaagac 3780 

aagccgtttg cacagtctcc acaatgtcag acgctaatca cagatggcac agtcagtgac 3840 

tttgctcacc taccaagcca cttcgcggaa gaagtgtcga agattgttgc tgagaaaggc 3900 

attccaaact tccaagcttg gcaagatggt ttgaaataca gtgacggcga aaaagcgttc 3960 

gctacagaaa atactcgcgt aaacttctgg gacgttctgt actggggcgg tacttcctca 4020 

gtgtacgagt ggtctaagaa aggttacgac gtgattgttt ctaacccaga ttacgtgtac 4 080 

atggatatgc catacgaagt tgacccgaaa gagcgtggtt actactgggc aacacgtgca 414 0 

acggatactc gtaagatgtt tggctttgca ccagagaaca tgcctcaaaa cgcagaaact 4200 

tctgtagatc gcgatggcaa tggctttact ggtaaaggtg aaatcgaagc gaaacctttc 4260 

tacggtctat ctgcacaact ttggtctgag acagtacgta acgacgagca atacgagtac 4320 

atggtattcc ctcgcgtcct cgctgctgcr cagcgtgcat ggcaccgtgc tgactgggaa 4 380 

aacgactaca aagttggtgt tgagtactcg caaaactcta atctagttga taaagcatcg 4440 

ctaaaccaag actacaaccg ctttgcgaac gtacttggtc aacgtgaact ggctaagcta 4500 

gaaaaatcag gtattgacta ccgcctacca gtaccaggtg caaaagtaga agatggtaag 4560 

ctagcaatga acgttcagtt ccctggcgta acgcttcaat actctctgga tggtgagaac 4 620 

tggttgactt atgcagacaa cgctcgtcca aatgtaactg gtgaagtctt catccgctcg 4680 

gtatctgcga caggtgagaa ggtaagccgt atcactagcg tgaaataata gcgctcagta 4740 

ttcactaaaa tcatagttcc ttactcaaag ccctcaactt atgttggggg ctttgtttat 4800 

ttttcttcgg aaaataagcg tgatcatgcg acaggtttga tgacaaaaaa ttagcgcaag 4860 

aagacaaaaa tcaccttgcg ctaatgctct gttacaggtc actaatacca tctaagtagt 4920 

tgattcatag tgactgcata tgttgtgttt tacagtatta tgtagtctgt tttttatgca 4980 

aaatctaatt taatatattg atatttatat cattttacgt ttctcgttca gcttttttat 5040 

actaagttgg cattataaaa aagcattgcn tatcaatttg ttgcaacgaa caggtcacta 5100 

tcagtcaaaa taaaatcatt atttgatttc aattttgtcc cactccctgc ctctgtcatc 5160 

acgatactgt gatgccatgg tgtccgactt atgcccgaga agatgttgag caaacttatc 5220 

gcttatctgc ttctcataga gtcttgcaga caaactgcgc aactcgtgaa aggtaggcgc 5280 

ccccggtcga atttgctttc gaatttctgc cattcatccg cttattatca cttattcagg 5340 

cgtagcacca ggcgtttaag ggcaccaata actgccttaa aaaaattacg ccccgccctg 5400 

ccactcatcg cagtactgtt gtaattcatt aagcattctg ccgacatgga agccatcaca 5460 

gacggcatga tgaacctgaa tcgccagcgg catcagcacc ttgtcgcctt gcgtataata 5520 

tttgcccatg gtgaaaacgg gggcgaagaa gttgtccata ttggccacgt ttaaatcaaa 5580 

actggtgaaa ctcacccagg gattggctga gacgaaaaac atattctcaa taaacccttt 5640 

agggaaatag gccaggtttt caccgtaaca cgccacatct tgcgaatata tgtgtagaaa 5700 

ctgccggaaa tcgtcgtggt attcactcca gagcgatgaa aacgtttcag tttgctcatg 5760 

gaaaacggtg taacaagggt gaacactatc ccatatcacc agctcaccgt ctttcattgc 5820 

catacg 5826 

<210> 12 
<211> 6071 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> pDYKll 
<400> 12 

gaattccgga tgagcattca tcaggcgggc aagaatgtga ataaaggccg gataaaactt 60 
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gtgcttattt ttctttacgg tctttaaaaa ggccgtaata tccagctgaa cggtctggtt 120 
ataggtacat tgagcaactg actgaaatgc ctcaaaatgt tctttacgat gccattggga 180 
tatatcaacg gtggtatatc cagtgatttt tttctccatt ttagcttcct tagctcctga 240 
aaatctcgat aactcaaaaa atacgcccgg tagtgatctt atttcattat ggtgaaagtt 300 
ggaacctctt acgtgccgat caacgtctca ttttcgccaa aagttggccc agggcttccc 360 
ggtatcaaca gggacaccag gatttattta ttctgcgaag tgatcttccg tcacaggtat 420 
ttattcggcg caaagtgcgt cgggtgatgc tgccaactta ctgatttagt gtatgatggt 480 
gtttttgagg tgctccagtg gcttctgttt ctatcagctg tccctcctgt tcagctactg 54 0 
acggggtggt gcgtaacggc aaaagcaccg ccggacatca gcgctagcgg agtgtgcggc 600 
cgcactggct tactatgttg gcactgatga gggtgtcagt gaagtgcttc atgtggcagg 660 
agaaaaaagg ctgcaccggt gcgtcagcag aatatgtgat acaggatata ttccgcttcc 720 
tcgctcactg actcgctacg ctcggtcgtt cgactgcggc gagcggaaat ggcttacgaa 780 
cggggcggag atttcctgga agatgccagg aagatactta acagggaagt gagagggccg 84 0 
cggcaaagcc gtttttccat aggctccgcc cccctgacaa gcatcacgaa atctgacgct 900 
caaatcagtg gtggcgaaac ccgacaggac tataaagata ccaggcgttt cccctggcgg 960 

ctccctcgtg cgctctcctg ttcctgcctt tcggtttacc ggtgtcattc cgctgttatg 1020 

gccgcgtttg tctcattcca cgcctgacac tcagttccgg gtaggcagtt cgctccaagc 1080 

tggactgtat gcacgaaccc cccgttcagt ccgaccgctg cgccttatcc ggtaactatc 1140 

gtcttgagtc caacccggaa agacatgcaa aagcaccact ggcagcagcc actggtaatt 1200 

gatttagagg agttagtctt gaagtcatgc gccggttaag gctaaactga aaggacaagt 12 60 

tttggtgact gcgctcctcc aagccagtta cctcggttca aagagttggt agctcagaga 1320 

accttcgaaa aaccgccctg caaggcggtt ttttcgtttt cagagcaaga gattacgcgc 1380 

agaccaaaac gatctcaaga agatcatctt atgcggccgc atcagataaa atatttctag 1440 

atgccgaact cagaagtgaa acgccgtagc gccgatggta gtgtggggtc tccccatgcg 1500 

agagtaggga actgccaggc atcaaataaa acgaaaggct cagtcgaaag actgggcctt 1560 

tcgttttatc tgttgtttgt cggtgaacgc tctcctgagt aggacaaatc cgccgggagc 1620 

ggatttgaac gttgcgaagc aacggcccgg agggtggcgg gcaggacgcc cgccataaac 1680 

tgccaggcat caaattaagc agaaggccat cctgacggat ggcctttttg cgtttctaca 1740 

aactcttcct gtcgtcatat ctacaagcca tcccccgcat gcatgaaacg atggacaccg 1800 

aagccatggg tgattaaaga ggccggattg taataattgt acactccgga gtcaattctc 18 60 

tttccttatt taccgcgctt ttccgcacct tttcgcaggg aaaatgtacg acctcacacc 1920 

agtggaaacc agcatggcgc gccgggtgga ggattatacg ggctgatggg taaagcgcaa 1980 

ggatcgtcct ggatctttat tagatcgatt aagccaattt ttgtctatgg tcattaaatt 2040 

ttccaatatg cggcgtaaat cgtgcccgcc tcgcggcagg atcgtttaca cttagcgagt 2100 

tctggaaagt cctgtggata aatcgggaaa atctgtgaga aacagaagat ctcttgcgca 2160 

gtttaggcta tgatccgcgg tcccgatcgt tttgcaggat cttgatcggg catataaccg 2220 

cagacagcgg ttcgtgcgtc accctcaagc agggtctttt cgacgtacgt caacaatcat 2280 

gaatgtttca gccttagtca ttatcgactt ttgttcgagt ggagtccgcc gtgtcacttt 2340 

cgctttggca gcagtgtctt gcccgattgc aggatgagtt ggtaccgagc tctgagcaac 2400 

aagttgtaaa ctcactggct gataaccttg atatccaata tgaagtgtta actaaccatg 2460 

gtgctaacga aggtcttgcg tgccaagata tgggcgcaga atgggcttct tgtaacaaag 2520 

taaacatgac gcttgttaac caaggtgaag ctgttgactc aaaagattgg gctatttact 2580 

tccacagcat tcgtctgatt ctggatgttg acaacgagca gttcaaaatc tctcgtgtaa 2640 

cgggtgacct acataagcta gaaccaacag ataagtttga cggcttcgct gccggtgaag 2700 

aggttgttct tccattggtt ggtgaatact ggcaactatt tgaaactgac ttcatgccgg 2760 

gtgcattcgt ttctgctcca aacgcagaac ctaagatgat tgcttctcta aatactgaag 2820 

atgttgcgtc ttttgtgacg ggtcttgaag gtaacaacct aaaacgtaca ccagatgaca 2880 

acaatgtatt tgcaaacgct gtgtctcgtt ttgagaaaaa cgaagaccta gcaacacaag 2940 

acgtatcaac cacgttacta ccaacaccaa tgcacgttga agcgggtaaa ggcaaagtag 3000 

atatcgcgga tggtattgcg ctgcctaaag acgcattcga tgcgactcag ttcgcagcga 3060 

ttcaagatcg tgcagaagtg gtaggtgtgg acgttcgtgg tgatcttcct gtaagcatca 3120 

ctgttgttcc tgcagacttc accggrgaat tagcaaaatc tggtgcttac gaaatgagca 3180 

tcaaaggcga cggtattgtg attaaagcgt tcgaccaagc aggcgctttc tacgcagtac 3240 

aatctatctt tggcctggta gatagccaaa atgctgattc tctaccacaa ctgtctatta 3300 

aagatgcgcc tcgttttgat taccgtggtg tgatggtgga tgtggctcgt aacttccact 3360 

ctaaggacgc aatccttgca acgctagacc aaatggcagc gtacaagatg aacaaacttc 3420 

accttcacct aaccgatgat gaaggctggc gtttagaaat cccgggtctg cctgagctga 3480 

cagaagtggg tgctaaccgt tgtttcgata cacaagagaa aagctgttta ctgcctcagc 3540 

ttggctctgg tccaacgaca gacaactttg gctctggcta cttcagcaaa gcagactacg 3600 

tggaaatctt gaaatacgcg aaagcacgta acattgaagt gattccagaa atcgatatgc 3660 

cagctcacgc tcgtgcagca gtagtatcaa tggaagctcg ttacgaccgc ctaatggaag 3720 



-5- 



WO 01/27322 



PCT/USOO/21048 



aaggtaaaga agctgaagcg aacgaatacc gtctgatgga tcctcaagat acatcaaacg 3780 

taacgacggt tcagttctac aataagcaaa gcttcatcaa cccatgtatg gaatcttcaa 3840 

ctcgctttgt tgataaggtg atttcagaag tggcagcaat gcaccaagaa gctggcgctc 3900 

cactaacaac ttggcacttc ggtggtgacg aagcgaagaa catcaagcta ggtgctggtt 3960 

tccaagacgt taacgcagaa gataaagtaa gctggaaagg cacgattgac ctgtctaaac 4020 

aagacaagcc gtttgcacag tctccacaat gtcagacgct aatcacagat ggcacagtca 4080 

gtgactttgc tcacctacca agccacttcg cggaagaagt gtcgaagatt gttgctgaga 4140 

aaggcattcc aaacttccaa gcttggcaag atggtttgaa atacagtgac ggcgaaaaag 4200 

cgttcgctac agaaaatact cgcgtaaact tctgggacgt tctgtactgg ggcggtactt 4260 

cctcagtgta cgagtggtct aagaaaggtt acgacgtgat tgtttctaac ccagattacg 4320 

tgtacatgga tatgccatac gaagttgacc cgaaagagcg tggttactac tgggcaacac 4 380 

gtgcaacgga . tactcgtaag atgtttggct ttgcaccaga gaacatgcct caaaacgcag 44 40 

aaacttctgt agatcgcgat ggcaatggct ttactggtaa aggtgaaatc gaagcgaaac 4500 

ctttctacgg tctatctgca caactttggt ctgagacagt acgtaacgac gagcaatacg 4560 

agtacatggt attccctcgc gtcctcgctg ctgctcagcg tgcatggcac cgtgctgact 4 620 

gggaaaacga ctacaaagtt ggtgttgagt actcgcaaaa ctctaatcta gttgataaag 4680 

catcgctaaa ccaagactac aaccgctttg cgaacgtact tggtcaacgt gaactggcta 474 0 

agctagaaaa atcaggtatt gactaccgcc taccagtacc aggtgcaaaa gtagaagatg 4800 

gtaagctagc aatgaacgtt cagttccctg gcgtaacgct tcaatactct ctggatggtg 48 60 

agaactggtt gacttatgca gacaacgctc gtccaaatgt aactggtgaa gtcttcatcc 4 920 

gctcggtatc tgcgacaggt gagaaggtaa gccgtatcac tagcgtgaaa taatagcgct 4 980 

cagtattcac taaaatcata gttccttact caaagccctc aacttatgtt gggggctttg 5040 

tttatttttc ttcggaaaat aagcgtgatc atgcgacagg tttgatgaca aaaaattagc 5100 

gcaagaagac aaaaatcacc ttgcgctaat gctctgttac aggtcactaa taccatctaa 5160 

gtagttgatt catagtgact gcatatgttg tgttttacag tattatgtag tctgtttttt 5220 

atgcaaaatc taatttaata tattgatatt tatatcattt tacgtttctc gttcagcttt 5280 

tttatactaa gttggcatta taaaaaagca ttgcttatca atttgttgca acgaacaggt 5340 

cactatcagt caaaataaaa tcattatttg atttcaattt tgtcccactc cctgcctctg 5400 

tcatcacgat actgtgatgc catggtgtcc gacttatgcc cgagaagatg ttgagcaaac 5460 

ttatcgctta tctgcttctc atagagtctt gcagacaaac tgcgcaactc gtgaaaggta 5520 

ggcgcccccg gtcgaatttg ctttcgaatt tctgccattc atccgcttat tatcacttat 5580 

tcaggcgtag caccaggcgt ttaagggcac caataactgc cttaaaaaaa ttacgccccg 5640 

ccctgccact catcgcagta ctgttgtaat tcattaagca ttctgccgac atggaagcca 5700 

tcacagacgg catgatgaac ctgaatcgcc agcggcatca gcaccttgtc gccttgcgta 5760 

taatatttgc ccatggtgaa aacgggggcg aagaagttgt ccatattggc cacgtttaaa 5820 

tcaaaactgg tgaaactcac ccagggattg gctgagacga aaaacatatt ctcaataaac 5880 

cctttaggga aataggccag gttttcaccg taacacgcca catcttgcga atatatgtgt 5940 

agaaactgcc ggaaatcgtc gtggtattca ctccagagcg atgaaaacgt ttcagtttgc 6000 

tcatggaaaa cggtgtaaca agggtgaaca ctatcccata tcaccagctc accgtctttc 6060 

attgccatac g 6071 
<210> 13 
<211> 5641 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> pJMF3 
<400> 13 

gaattccgga tgagcattca tcaggcgggc aagaatgtga ataaaggccg gataaaactt 60 

gtgcttattt ttctttacgg tctttaaaaa ggccgtaata tccagctgaa cggtctggtt 120 

ataggtacat tgagcaactg actgaaatgc ctcaaaatgt tctttacgat gccattggga 180 

tatatcaacg gtggtatatc cagtgatttt tttctccatt ttagcttcct tagctcctga 240 

aaatctcgat aactcaaaaa atacgcccgg tagtgatctt atttcattat ggtgaaagtt 300 

ggaacctctt acgtgccgat caacgtctca ttttcgccaa aagttggccc agggcttccc 360 

ggtatcaaca gggacaccag gatttattta ttctgcgaag tgatcttccg tcacaggtat 420 

ttattcggcg caaagtgcgt cgggtgatgc tgccaactta ctgatttagt gtatgatggt 4 80 

gtttttgagg tgctccagtg gcttctgttt ctatcagctg tccctcctgt tcagctactg 540 

acggggtggt gcgtaacggc aaaagcaccg ccggacatca gcgctagcgg agtgtgcggc 600 

cgcactggct tactatgttg gcactgatga gggtgtcagt gaagtgcttc atgtggcagg 660 

agaaaaaagg ctgcaccggt gcgtcagcag aatatgtgat acaggatata ttccgcttcc 720 
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tcgctcactg actcgctacg ctcggtcgtt cgactgcggc gagcggaaat ggcttacgaa 780 

cggggcggag atttcctgga agatgccagg aagatactta acagggaagt gagagggccg 840 

cggcaaagcc gtttttccat aggctccgcc cccctgacaa gcatcacgaa atctgacgct 900 

caaatcagtg gtggcgaaac ccgacaggac tataaagata ccaggcgttt cccctggcgg 960 

ctccctcgtg cgctctcctg ttcctgcctt tcggtttacc ggtgtcattc cgctgttatg 1020 

gccgcgtttg tctcattcca cgcctgacac tcagttccgg gtaggcagtt cgctccaagc 1080 

tggactgtat gcacgaaccc cccgttcagt ccgaccgctg cgccttatcc ggtaactatc 1140 

gtcttgagtc caacccggaa agacatgcaa aagcaccact ggcagcagcc actggtaatt 1200 

gatttagagg agttagtctt gaagtcatgc gccggttaag gctaaactga aaggacaagt 1260 

tttggtgact gcgctcctcc aagccagtta cctcggttca aagagttggt agctcagaga 1320 

accttcgaaa aaccgccctg caaggcggtt ttttcgtttt cagagcaaga gattacgcgc 1380 

agaccaaaac. gatctcaaga agatcatctt atgcggccgc atcagataaa atatttctag 1440 

atgccgaact cagaagtgaa acgccgtagc gccgatggta gtgtggggtc tccccatgcg 1500 

agagtaggga actgccaggc atcaaataaa acgaaaggct cagtcgaaag actgggcctt 1560 

tcgttttatc tgttgtttgt cggtgaacgc tctcctgagt aggacaaatc cgccgggagc 1620 

ggatttgaac gttgcgaagc aacggcccgg agggtggcgg gcaggacgcc cgccataaac 1680 

tgccaggcat caaattaagc agaaggccat cctgacggat ggcctttttg cgtttctaca 1740 

aactcttcct gtcgtcatat ctacaagcca tcccccgcat gcattaatgt gagttagctc 1800 

actcattagg caccccaggc tttacacttt atgcttccgg ctcgtatgtt gtgtggaatt 1860 

gtgagcggat aacaatttca cacaggaaac agctatgacc atgattacgc caagcttgca 1920 

tgcctgcagg tcgactctag aggatccccg ggtaccgagc tctgagcaac aagttgtaaa 1980 

ctcactggct gataaccttg atatccaata tgaagtgtta actaaccatg gtgctaacga 2040 

aggtcttgcg tgccaagata tgggcgcaga atgggcttct tgtaacaaag taaacatgac 2100 

gcttgttaac caaggtgaag ctgttgactc aaaagattgg gctatttact tccacagcat 2160 

tcgtctgatt ctggatgttg acaacgagca gttcaaaatc tctcgtgtaa cgggtgacct 2220 

acataagcta gaaccaacag ataagtttga cggcttcgct gccggtgaag aggttgttct 2280 

tccattggtt ggtgaatact ggcaactatt tgaaactgac ttcatgccgg gtgcattcgt 2340 

ttctgctcca aacgcagaac ctaagatgat tgcttctcta aatactgaag atgttgcgtc 2400 

ttttgtgacg ggtcttgaag gtaacaacct aaaacgtaca ccagatgaca acaatgtatt 24 60 

tgcaaacgct gtgtctcgtt ttgagaaaaa cgaagaccta gcaacacaag acgtatcaac 2520 

cacgttacta ccaacaccaa tgcacgttga agcgggtaaa ggcaaagtag atatcgcgga 2580 

tggtattgcg ctgcctaaag acgcattcga tgcgactcag ttcgcagcga ttcaagatcg 2640 

tgcagaagtg gtaggtgtgg acgttcgtgg tgatcttcct gtaagcatca ctgttgttcc 2700 

tgcagacttc accggtgaat tagcaaaatc tggtgcttac gaaatgagca tcaaaggcga 2760 

cggtattgtg attaaagcgt tcgaccaagc aggcgctttc tacgcagtac aatctatctt 2820 

tggcctggta gatagccaaa atgctgattc tctaccacaa ctgtctatta aagatgcgcc 2880 

tcgttttgat taccgtggtg tgatggtgga tgtggctcgt aacttccact ctaaggacgc 294 0 

aatccttgca acgctagacc aaatggcagc gtacaagatg aacaaacttc accttcacct 3000 

aaccgatgat gaaggctggc gtttagaaat cccgggtctg cctgagctga cagaagtggg 3060 

tgctaaccgt tgtttcgata cacaagagaa aagctgttta ctgcctcagc ttggctctgg 3120 

tccaacgaca gacaactttg gctctggcta cttcagcaaa gcagactacg tggaaatctt 3180 

gaaatacgcg aaagcacgta acattgaagt gattccagaa atcgatatgc cagctcacgc 3240 

tcgtgcagca gtagtatcaa tggaagctcg ttacgaccgc ctaatggaag aaggtaaaga 3300 

agctgaagcg aacgaatacc gtctgatgga tcctcaagat acatcaaacg taacgacggt 3360 

tcagttctac aataagcaaa gcttcatcaa cccatgtatg gaatcttcaa ctcgctttgt 3420 

tgataaggtg atttcagaag tggcagcaat gcaccaagaa gctggcgctc cactaacaac 3480 

ttggcact'tc ggtggtgacg aagcgaagaa catcaagcta ggtgctggtt tccaagacgt 3540 

taacgcagaa gataaagtaa gctggaaagg cacgattgac ctgtctaaac aagacaagcc 3600 

gtttgcacag tctccacaat gtcagacgcr aatcacagat ggcacagtca gtgactttgc 3660 

tcacctacca agccacttcg cggaagaagt gtcgaagatt gttgctgaga aaggcattcc 3720 

aaacttccaa gcttggcaag atggtttgaa atacagtgac ggcgaaaaag cgttcgctac 3780 

agaaaatact cgcgtaaact tctgggacgt tctgtactgg ggcggtactt cctcagtgta 3840 

cgagtggtct aagaaaggtt acgacgtgat tgtttctaac ccagattacg tgtacatgga 3900 

tatgccatac gaagttgacc cgaaagagcg tggttactac tgggcaacac gtgcaacgga 3960 

tactcgtaag atgtttggct ttgcaccaga gaacatgcct caaaacgcag aaacttctgt 4020 

agatcgcgat ggcaatggct ttactggtaa aggtgaaatc gaagcgaaac ctttctacgg 4080 

tctatctgca caactttggt ctgagacagt acgtaacgac gagcaatacg agtacatggt 4140 

attccctcgc gtcctcgctg ctgctcagcg tgcatggcac cgtgctgact gggaaaacga 4200 

ctacaaagtt ggtgttgagt actcgcaaaa ctctaatcta gttgataaag catcgctaaa 4260 

ccaagactac aaccgctttg cgaacgtact tggtcaacgt gaactggcta agctagaaaa 4320 

atcaggtatt gactaccgcc taccagtacc aggtgcaaaa gtagaagatg gtaagctagc 4380 
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aatgaacgtt cagttccctg gcgtaacgct tcaatactct ctggatggtg agaactggtt 4 4 40 

gacttatgca gacaacgctc gtccaaatgt aactggtgaa gtcttcatcc gctcggtatc 4500 

tgcgacaggt gagaaggtaa gccgtatcac tagcgtgaaa taatagcgct cagtattcac 4560 

taaaatcata gttccttact caaagccctc aacttatgtt gggggctttg tttatttttc 4 620 

ttcggaaaat aagcgtgatc atgcgacagg tttgatgaca aaaaattagc gcaagaagac 4680 

aaaaatcacc ttgcgctaat gctctgttac aggtcactaa taccatctaa gtagttgatt 4740 

catagtgact gcatatgttg tgttttacag tattatgtag tctgtttttt atgcaaaatc 4800 

taatttaata tattgatatt tatatcattt tacgtttctc gttcagcttt tttatactaa 4860 

gttggcatta taaaaaagca ttgcttatca atttgttgca acgaacaggt cactatcagt 4 920 

caaaataaaa tcattatttg atttcaattt tgtcccactc cctgcctctg tcatcacgat 4 980 

actgtgatgc catggtgtcc gacttatgcc cgagaagatg ttgagcaaac ttatcgctta 5040 

tctgcttctc atagagtctt gcagacaaac tgcgcaactc gtgaaaggta ggcgcccccg 5100 

gtcgaatttg ctttcgaatt tctgccattc atccgcttat tatcacttat tcaggcgtag 5160 

caccaggcgt ttaagggcac caataactgc cttaaaaaaa ttacgccccg ccctgccact 5220 

catcgcagta ctgttgtaat tcattaagca ttctgccgac atggaagcca tcacagacgg 5280 

catgatgaac ctgaatcgcc agcggcatca gcaccttgtc gccttgcgta taatatttgc 5340 

ccatggtgaa aacgggggcg aagaagttgt ccatattggc cacgtttaaa tcaaaactgg 5400 

tgaaactcac ccagggattg gctgagacga aaaacatatt ctcaataaac cctttaggga 54 60 

aataggccag gttttcaccg taacacgcca catcttgcga atatatgtgt agaaactgcc 5520 

ggaaatcgtc gtggtattca ctccagagcg atgaaaacgt ttcagtttgc tcatggaaaa 5580 

cggtgtaaca agggtgaaca ctatcccata tcaccagctc accgtctttc attgccatac 5640 

g " 5641 

<210> 14 
<211> 5670 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> pJMF4 
<400> 14 

gaattccgga tgagcattca tcaggcgggc aagaatgtga ataaaggccg gataaaactt 60 

gtgcttattt ttctttacgg tctttaaaaa ggccgtaata tccagctgaa cggtctggtt 120 

ataggtacat tgagcaactg actgaaatgc ctcaaaatgt tctttacgat gccattggga 180 

tatatcaacg gtggtatatc cagtgatttt tttctccatt ttagcttcct tagctcctga 240 

aaatctcgat aactcaaaaa atacgcccgg tagtgatctt atttcattat ggtgaaagtt 300 

ggaacctctt acgtgccgat caacgtctca ttttcgccaa aagttggccc agggcttccc 360 

ggtatcaaca gggacaccag gatttattta ttctgcgaag tgatcttccg tcacaggtat 420 

ttattcggcg caaagtgcgt cgggtgatgc tgccaactta ctgatttagt gtatgatggt 480 

gtttttgagg tgctccagtg gcttctgttt ctatcagctg tccctcctgt tcagctactg 540 

acggggtggt gcgtaacggc aaaagcaccg ccggacatca gcgctagcgg agtgtgcggc 600 

cgcactggct tactatgttg gcactgatga gggtgtcagt gaagtgcttc atgtggcagg 660 

agaaaaaagg ctgcaccggt gcgtcagcag aatatgtgat acaggatata ttccgcttcc 720 

tcgctcactg actcgctacg ctcggtcgtt cgactgcggc gagcggaaat ggcttacgaa 780 

cggggcggag atttcctgga agatgccagg aagatactta acagggaagt gagagggccg 840 

cggcaaagcc gtttttccat aggctccgcc cccctgacaa gcatcacgaa atctgacgct 900 

caaatcagtg gtggcgaaac ccgacaggac tataaagata ccaggcgttt cccctggcgg 960 

ctccctcgtg cgctctcctg ttcctgcctt tcggtttacc ggtgtcattc cgctgttatg 1020 

gccgcgtttg tctcattcca cgcctgacac tcagttccgg gtaggcagtt cgctccaagc 1080 

tggactgtat gcacgaaccc cccgttcagt ccgaccgctg cgccttatcc ggtaactatc 114 0 

gtcttgagtc caacccggaa agacatgcaa aagcaccact ggcagcagcc actggtaatt 1200 

gatttagagg agttagtctt gaagtcatgc gccggttaag gctaaactga aaggacaagt 1260 

tttggtgact gcgctcctcc aagccagtta cctcggttca aagagttggt agctcagaga 1320 

accttcgaaa aaccgccctg caaggcggtt ttttcgtttt cagagcaaga gattacgcgc 1380 

agaccaaaac gatctcaaga agatcatctt atgcggccgc atcagataaa atatttctag 14 4 0 

atgccgaact cagaagtgaa acgccgtagc gccgatggta gtgtggggtc tccccatgcg 1500 

agagtaggga actgccaggc atcaaataaa acgaaaggct cagtcgaaag actgggcctt 1560 

tcgttttatc tgttgtttgt cggtgaacgc tctcctgagt aggacaaatc cgccgggagc 1620 

ggatttgaac gttgcgaagc aacggcccgg agggtggcgg gcaggacgcc cgccataaac 1680 

tgccaggcat caaattaagc agaaggccat cctgacggat ggcctttttg cgtttctaca 174 0 
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aactcttcct gtcgtcatat ctacaagcca tcccccgcat gcattaatgt gagttagctc 1800 

actcattagg caccccaggc tttacacttt atgcttccgg ctcgtatgtt gtgtggaatt 1860 

gtgagcggat aacaatttca cacaggaaac agctatgacc atgattacgc caagcttgca 1920 

tgcctgcagg tcgactctag aggatccccg ggtaccgagc tctgagcaac aagttgtaaa 1980 

ctcactggct gataaccttg atatccaata tgaagtgtta actaaccatg gtgctaacga 204 0 

aggtcttgcg tgccaagata tgggcgcaga atgggcttct tgtaacaaag taaacatgac 2100 

gcttgttaac caaggtgaag ctgttgactc aaaagattgg gctatttact tccacagcat 2160 

tcgtctgatt ctggatgttg acaacgagca gttcaaaatc tctcgtgtaa cgggtgacct 2220 

acataagcta gaaccaacag ataagtttga cggcttcgct gccggtgaag aggttgttct 2280 

tccattggtt ggtgaatact ggcaactatt tgaaactgac ttcatgccgg gtgcattcgt 2340 

ttctgctcca aacgcagaac ctaagatgat tgcttctcta aatactgaag atgttgcgtc 2400 

ttttgtgacg ggtcttgaag gtaacaacct aaaacgtaca ccagatgaca acaatgtatt 24 60 

tgcaaacgct gtgtctcgtt ttgagaaaaa cgaagaccta gcaacacaag acgtatcaac 2520 

cacgttacta ccaacaccaa tgcacgttga agcgggtaaa ggcaaagtag atatcgcgga 2580 

tggtattgcg ctgcctaaag acgcattcga tgcgactcag ttcgcagcga ttcaagatcg 2640 

tgcagaagtg gtaggtgtgg acgttcgtgg tgatcttcct gtaagcatca ctgttgttcc 2700 

tgcagacttc accggtgaat tagcaaaatc tggtgcttac gaaatgagca tcaaaggcga 27 60 

cggtattgtg attaaagcgt tcgaccaagc aggcgctttc tacgcagtac aatctatctt 2820 

tggcctggta gatagccaaa atgctgattc tctaccacaa ctgtctatta aagatgcgcc 2880 

tcgttttgat taccgtggtg tgatggtgga tgtggctcgt aacttccact ctaaggacgc 2940 

aatccttgca acgctagacc aaatggcagc gtacaagatg aacaaacttc accttcacct 3000 

aaccgatgat gaaggctggc gtttagaaat cccgggtctg cctgagctga cagaagtggg 3060 

tgctaaccgt tgtttcgata cacaagagaa aagctgttta ctgcctcagc ttggctctgg 3120 

tccaacgaca gacaactttg gctctggcta cttcagcaaa gcagactacg tggaaatctt 3180 

gaaatacgcg aaagcacgta acattgaagt gattccagaa atcgatatgc cagctcacgc 3240 

tcgtgcagca gtagtatcaa tggaagctcg ttacgaccgc ctaatggaag aaggtaaaga 3300 

agctgaagcg aacgaatacc gtctgatgga tcctcaagat acatcaaacg taacgacggt 3360 

tcagttctac aataagcaaa gcttcatcaa cccatgtatg gaatcttcaa ctcgctttgt 3420 

tgataaggtg atttcagaag tggcagcaat gcaccaagaa gctggcgctc cactaacaac 3480 

ttggcacttc ggtggtgacg aagcgaagaa catcaagcta ggtgctggtt tccaagacgt 3540 

taacgcagaa gataaagtaa gctggaaagg cacgattgac ctgtctaaac aagacaagcc 3600 

gtttgcacag tctccacaat gtcagacgct aatcacagat ggcacagtca gtgactttgc 3660 

tcacctacca agccacttcg cggaagaagt gtcgaagatt gttgctgaga aaggcattcc 3720 

aaacttccaa gcttggcaag atggtttgaa atacagtgac ggcgaaaaag cgttcgctac 3780 

agaaaatact cgcgtaaact tctgggacgt tctgtactgg ggcggtactt cctcagtgta 3840 

cgagtggtct aagaaaggtt acgacgtgat tgtttctaac ccagattacg tgtacatgga 3900 

tatgccatac gaagttgacc cgaaagagcg tggttactac tgggcaacac gtgcaacgga 3960 

tactcgtaag atgtttggct ttgcaccaga gaacatgcct caaaacgcag aaacttctgt 4020 

agatcgcgat ggcaatggct ttactggtaa aggtgaaatc gaagcgaaac ctttctacgg 4080 

tctatctgca caactttggt ctgagacagt acgtaacgac gagcaatacg agtacatggt 4140 

attccctcgc gtcctcgctg ctgctcagcg tgcatggcac cgtgctgact gggaaaacga 4200 

ctacaaagtt ggtgttgagt actcgcaaaa ctctaatcta gttgataaag catcgctaaa 4260 

ccaagactac aaccgctttg cgaacgtact tggtcaacgt gaactggcta agctagaaaa 4320 

atcaggtatt gactaccgcc taccagtacc aggtgcaaaa gtagaagatg gtaagctagc 4380 

aatgaacgtt cagttccctg gcgtaacgct tcaatactct ctggatggtg agaactggtt 4440 

gacttatgca gacaacgctc gtccaaatgt aactggtgaa gtcttcatcc gctcggtatc 4500 

tgcgacaggt gagaaggtaa gccgtatcac tagcgtgaaa taatagcgct cagtattcac 4560 

taaaatcata gttccttact caaagccctc aacttatgtt gggggctttg tttatttttc 4620 

ttcggaaaat aagcgtgatc ccccgggggc gcctaccttt cacgagttgc gcagtttgtc 4680 

tgcaagactc tatgagaagc agataagcga taagtttgct caacatcttc tcgggcataa 4740 

gtcggacacc atggcatcac agtatcgtga tgacagaggc agggagtggg acaaaattga 4800 

aatcaaataa tgattttatt ttgactgata gtgacctgtt cgttgcaaca aattgataag 4860 

caatgctttt ttataatgcc aacttagtat aaaaaagctg aacgagaaac gtaaaatgat 4920 

ataaatatca atatattaaa ttagattttg cataaaaaac agactacata atactgtaaa 4980 

acacaacata tgcagtcact atgaatcaac tacttagatg gtattagtga cctgtaacag 5040 

agcattagcg caaggtgatt tttgtcttct tgcgctaatt ttttgtcatc aaacctgtcg 5100 

catgatcatg gggctgcagg aattcgatgg tcgaatttgc tttcgaattt ctgccattca 5160 

tccgcttatt atcacttatt caggcgtagc accaggcgtt taagggcacc aataactgcc 5220 

ttaaaaaaat tacgccccgc cctgccactc atcgcagtac tgttgtaatt cattaagcat 5280 

tctgccgaca tggaagccat cacagacggc atgatgaacc tgaatcgcca gcggcatcag 5340 

caccttgtcg ccttgcgtat aatatttgcc catggtgaaa acgggggcga agaagttgtc 5400 
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catattggcc acgtttaaat caaaactggt gaaactcacc cagggattgg ctgagacgaa 5460 

aaacatattc tcaataaacc ctttagggaa ataggccagg ttttcaccgt aacacgccac 5520 

atcttgcgaa tatatgtgta gaaactgccg gaaatcgtcg tggtattcac tccagagcga 5580 

tgaaaacgtt tcagtttgct catggaaaac ggtgtaacaa gggtgaacac tatcccatat 5640 

caccagctca ccgtctttca ttgccatacg 5670 

<210> 15 
<211> 238 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> LacZ(/chitobiase Fusion 

<220> 
<221> CDS 

<222> (119) . . . (240) 

<221> -35_signal 
<222> (44) . . . (50) 
<223> Lac promoter 

<221> -10_signal 

<222> (69) . . . (74) 

<223> Lac promoter 

<221> protein_bind 

<222> (81) . . . (107) 

<223> Lac repressor binding site 

<221> protein_bind 

<222> (7) . . . (34) 

<223> CAP-cAMP binding site 

<400> 15 

gcatgcatta atgtgagtta gctcactcat taggcacccc aggctttaca ctttatgctt 60 
ccggctcgta tgttgtgtgg aattgtgagc ggataacaat ttcacacagg aaacagct 118 
atg acc atg att acg cca age ttg cat gec tgc agg teg act eta gag 166 
Met Thr Met lie Thr Pro Ser Leu His Ala Cys Arg Ser Thr Leu Glu 
15 10 15 

gat ccc egg gta ccg age tct gag caa caa gtt gta aac tea ctg get 214 
Asp Pro Arg Val Pro Ser Ser Glu Gin Gin Val Val Asn Ser Leu Ala 
20 25 30 

gat aac ctt gat ate caa tat gaa 238 
Asp Asn Leu Asp lie Gin Tyr Glu 
35 40 



<210> 16 
<211> 40 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> LacZ (/chitobiase Fusion 
<400> 16 

Met Thr Met He Thr Pro Ser Leu 

1 5 
Asp Pro Arg Val Pro Ser Ser Glu 
20 

Asp Asn Leu Asp He Gin Tyr Glu 



His Ala Cys Arg Ser Thr Leu Glu 

10 15 
Gin Gin Val Val Asn Ser Leu Ala 
25 30 
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35 40 

<210> 17 
<211> 22 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> dnaA/chitobiase Fusion 
<400> 17 

Met Ser Leu Ser Leu Trp Gin Gin Cys 

1 5 
Leu Val Pro Ser Ser Glu 
20 

<210> 18 
<211> 2643 
<212> DNA 

<213> Artificial Sequence 

<220> 
<221> CDS 
<222> (1) . . . (2643) 

<223> dnaA/Chitobiase Fusion 

<400> 18 

gtg tea ctt teg ctt tgg cag cag tgt ctt gec cga ttg cag gat gag 48 
Met Ser Leu Ser Leu Trp Gin Gin Cys Leu Ala Arg Leu Gin Asp Glu 
15 10 15 

ttg gta ccg age tct gag caa caa gtt gta aac tea ctg get gat aac 96 
Leu Val Pro Ser Ser Glu Gin Gin Val Val Asn Ser Leu Ala Asp Asn 
20 25 30 

ctt gat ate caa tat gaa gtg tta act aac cat ggt get aac gaa ggt 144 
Leu Asp lie Gin Tyr Glu Val Leu Thr Asn His Gly Ala Asn Glu Gly 
35 40 45 



Leu Ala Arg Leu Gin Asp Glu 
10 15 



ctt gcg tgc caa gat atg ggc gca gaa tgg get tct tgt aac aaa gta 192 

Leu Ala Cys Gin Asp Met Gly Ala Glu Trp Ala Ser Cys Asn Lys Val 

50 55 60 

aac atg acg ctt gtt aac caa ggt gaa get gtt gac tea aaa gat tgg 240 

Asn Met Thr Leu Val Asn Gin Gly Glu Ala Val Asp Ser Lys Asp Trp 

65 70 75 80 

get att tac ttc cac age att cgt ctg att ctg gat gtt gac aac gag 288 

Ala lie Tyr Phe His Ser lie Arg Leu lie Leu Asp Val Asp Asn Glu 

85 90 95 

cag ttc aaa ate tct cgt gta acg ggt gac eta cat aag eta gaa cca 336 

Gin Phe Lys lie Ser Arg Val Thr Gly Asp Leu His Lys Leu Glu Pro 

100 105 110 

aca gat aag ttt gac ggc ttc get gee ggt gaa gag gtt gtt ctt cca 384 

Thr Asp Lys Phe Asp Gly Phe Ala Ala Gly Glu Glu Val Val Leu Pro 

115 120 125 

ttg gtt ggt gaa tac tgg caa eta ttt gaa act gac ttc atg ccg ggt 432 

Leu Val Gly Glu Tyr Trp Gin Leu Phe Glu Thr Asp Phe Met Pro Gly 
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130 135 140 

gca ttc gtt tct get cca aac gca gaa cct aag atg att get tct eta 4 80 

Ala Phe Val Ser Ala Pro Asn Ala Glu Pro Lys Met lie Ala Ser Leu 

145 150 155 160 

aat act gaa gat gtt gcg tct ttt gtg acg ggt ctt gaa ggt aac aac 528 

Asn Thr Glu Asp Val Ala Ser Phe Val Thr Gly Leu Glu Gly Asn Asn 

165 170 175 

eta aaa cgt aca cca gat gac aac aat gta ttt gca aac get gtg tct 57 6 

Leu Lys Arg Thr Pro Asp Asp Asn Asn Val Phe Ala Asn Ala Val Ser 

180 185 190 

cgt ttt gag aaa aac gaa gac eta gca aca caa gac gta tea ace acg 624 

Arg Phe Glu Lys Asn Glu Asp Leu Ala Thr Gin Asp Val Ser Thr Thr 

195 200 205 

tta eta cca aca cca atg cac gtt gaa gcg ggt aaa ggc aaa gta gat 672 

Leu Leu Pro Thr Pro Met His Val Glu Ala Gly Lys Gly Lys Val Asp 

210 215 220 

ate gcg gat ggt att gcg ctg cct aaa gac gca ttc gat gcg act cag 720 

lie Ala Asp Gly lie Ala Leu Pro Lys Asp Ala Phe Asp Ala Thr Gin 

225 " 230 235 24 0 

ttc gca gcg att caa gat cgt gca gaa gtg gta ggt gtg gac gtt cgt 7 68 

Phe Ala Ala lie Gin Asp Arg Ala Glu Val Val Gly Val Asp Val Arg 

245 250 255 

ggt gat ctt cct gta age ate act gtt gtt cct gca gac ttc ace ggt 816 

Gly Asp Leu Pro Val Ser lie Thr Val Val Pro Ala Asp Phe Thr Gly 

260 265 270 

gaa tta gca aaa tct ggt get tac gaa atg age ate aaa ggc gac ggt 864 

Glu Leu Ala Lys Ser Gly Ala Tyr Glu Met Ser lie Lys Gly Asp Gly 

275 280 285 

att gtg att aaa gcg ttc gac caa gca ggc get ttc tac gca gta caa 912 

lie Val lie Lys Ala Phe Asp Gin Ala Gly Ala Phe Tyr Ala Val Gin 

290 295 300 

tct ate ttt ggc ctg gta gat age caa aat get gat tct eta cca caa 960 

Ser lie Phe Gly Leu Val Asp Ser Gin Asn Ala Asp Ser Leu Pro Gin 

305 " 310 315 320 

ctg tct att aaa gat gcg cct cgt ttt gat tac cgt ggt gtg atg gtg 1008 

Leu Ser lie Lys Asp Ala Pro Arg Phe Asp Tyr Arg Gly Val Met Val 

325 330 335 

gat gtg get cgt aac ttc cac tct aag gac gca ate ctt gca acg eta 1056 

Asp Val Ala Arg Asn Phe His Ser Lys Asp Ala lie Leu Ala Thr Leu 

340 345 350 

gac caa atg gca gcg tac aag atg aac aaa ctt cac ctt cac eta acc 1104 

Asp Gin Met Ala Ala Tyr Lys Met Asn Lys Leu His Leu His Leu Thr 

355 360 365 

gat gat gaa ggc tgg cgt tta gaa ate ccg ggt ctg cct gag ctg aca 1152 

Asp Asp Glu Gly Trp Arg Leu Glu lie Pro Gly Leu Pro Glu Leu Thr 

370 375 380 
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gaa gtg ggt get aac cgt tgt ttc gat aca caa gag aaa age tgt tta 1200 
Glu Val Gly Ala Asn Arg Cys Phe Asp Thr Gin Glu Lys Ser Cys Leu 
385 390 395 400 

ctg cct cag ctt ggc tct ggt cca acg aca gac aac ttt ggc tct ggc 1248 
Leu Pro Gin Leu Gly Ser Gly Pro Thr Thr Asp Asn Phe Gly Ser Gly 
405 410 415 

tac ttc age aaa gca gac tac gtg gaa ate ttg aaa tac gcg aaa gca 1296 
Tyr Phe Ser Lys Ala Asp Tyr Val Glu lie Leu Lys Tyr Ala Lys Ala 
420 425 430 

cgt aac att gaa gtg att cca gaa ate gat atg cca get cac get cgt 1344 
Arg Asn lie Glu Val lie Pro Glu lie Asp Met Pro Ala His Ala Arg 
435 440 445 

gca gca gta gta tea atg gaa get cgt tac gac cgc eta atg gaa gaa 1392 
Ala Ala Val Val Ser Met Glu Ala Arg Tyr Asp Arg Leu Met Glu Glu 
450 455 460 

ggt aaa gaa get gaa gcg aac gaa tac cgt ctg atg gat cct caa gat 1440 
Gly Lys Glu Ala Glu Ala Asn Glu Tyr Arg Leu Met Asp Pro Gin Asp 
465 470 475 480 

aca tea aac gta acg acg gtt cag ttc tac aat aag caa age ttc ate 1488 
Thr Ser Asn Val Thr Thr Val Gin Phe Tyr Asn Lys Gin Ser Phe lie 
485 490 495 

aac cca tgt atg gaa tct tea act cgc ttt gtt gat aag gtg att tea 1536 
Asn Pro Cys Met Glu Ser Ser Thr Arg Phe Val Asp Lys Val lie Ser 
500 505 510 

gaa gtg gca gca atg cac caa gaa get ggc get cca eta aca act tgg 1584 
Glu Val Ala Ala Met His Gin Glu Ala Gly Ala Pro Leu Thr Thr Trp 
515 520 525 

cac ttc ggt ggt gac gaa gcg aag aac ate aag eta ggt get ggt ttc 1632 
His Phe Gly Gly Asp Glu Ala Lys Asn lie Lys Leu Gly Ala Gly Phe 
530 535 540 

caa gac gtt aac gca gaa gat aaa gta age tgg aaa ggc acg att gac 1680 
Gin Asp Val Asn Ala Glu Asp Lys Val Ser Trp Lys Gly Thr He Asp 
545 550 555 560 

ctg tct aaa caa gac aag ccg ttt gca cag tct cca caa tgt cag acg 1728 
Leu Ser Lys Gin Asp Lys Pro Phe Ala Gin Ser Pro Gin Cys Gin Thr 
565 " 570 575 

eta ate aca gat ggc aca gtc agt gac ttt get cac eta cca age cac 1776 
Leu lie Thr Asp Gly Thr Val Ser Asp Phe Ala His Leu Pro Ser His 
580 585 590 

ttc gcg gaa gaa gtg teg aag att gtt get gag aaa ggc att cca aac 1824 
Phe Ala Glu Glu Val Ser Lys He Val Ala Glu Lys Gly He Pro Asn 
595 600 605 

ttc caa get tgg caa gat ggt ztq aaa tac agt gac ggc gaa aaa gcg 1872 
Phe Gin Ala Trp Gin Asp Gly Leu Lys Tyr Ser Asp Gly Glu Lys Ala 
610 615 620 
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ttc get aca gaa aat act cgc gta aac ttc tgg gac gtt ctg tac tgg 1920 
Phe Ala Thr Glu Asn Thr Arg Val Asn Phe Trp Asp Val Leu Tyr Trp 
625 630 635 640 

ggc ggt act tec tea gtg tac gag tgg tct aag aaa ggt tac gac gtg 1968 
Gly Gly Thr Ser Ser Val Tyr Glu Trp Ser Lys Lys Gly Tyr Asp Val 
645 650 655 

att gtt tct aac cca gat tac gtg tac atg gat atg cca tac gaa gtt 2016 
He Val Ser Asn Pro Asp Tyr Val Tyr Met Asp Met Pro Tyr Glu Val 
660 665 670 

gac ccg aaa gag cgt ggt tac tac tgg gca aca cgt gca acg gat act 2064 
Asp Pro Lys Glu Arg Gly Tyr Tyr Trp Ala Thr Arg Ala Thr Asp Thr 
675 680 685 

cgt aag atg ttt ggc ttt gca cca gag aac atg cct caa aac gca gaa 2112 
Arg Lys Met Phe Gly Phe Ala Pro Glu Asn Met Pro Gin Asn Ala Glu 
690 695 700 

act tct gta gat cgc gat ggc aat ggc ttt act ggt aaa ggt gaa ate 2160 
Thr Ser Val Asp Arg Asp Gly Asn Gly Phe Thr Gly Lys Gly Glu He 
705 710 715 720 

gaa gcg aaa cct ttc tac ggt eta tct gca caa ctt tgg tct gag aca 2208 
Glu Ala Lys Pro Phe Tyr Gly Leu Ser Ala Gin Leu Trp Ser Glu Thr 
725 730 735 

gta cgt aac gac gag caa tac gag tac atg gta ttc cct cgc gtc etc 2256 
Val Arg Asn Asp Glu Gin Tyr Glu Tyr Met Val Phe Pro Arg Val Leu 
740 745 750 

get get get cag cgt gca tgg cac cgt get gac tgg gaa aac gac tac 2304 
Ala Ala Ala Gin Arg Ala Trp His Arg Ala Asp Trp Glu Asn Asp Tyr 
755 760 765 

aaa gtt ggt gtt gag tac teg caa aac tct aat eta gtt gat aaa gca 2352 
Lys Val Gly Val Glu Tyr Ser Gin Asn Ser Asn Leu Val Asp Lys Ala 
770 " 775 780 

teg eta aac caa gac tac aac cgc ttt gcg aac gta ctt ggt caa cgt 2400 
Ser Leu Asn Gin Asp Tyr Asn Arg Phe Ala Asn Val Leu Gly Gin Arg 
785 790 " 795 800 

gaa ctg get aag eta gaa aaa tea ggt att gac tac cgc eta cca gta 2448 
Glu Leu Ala Lys Leu Glu Lys Ser Gly He Asp Tyr Arg Leu Pro Val 
805 810 815 

cca ggt gca aaa gta gaa gat ggt aag eta gca atg aac gtt cag ttc 24 96 

Pro Gly Ala Lys Val Glu Asp Gly Lys Leu Ala Met Asn Val Gin Phe 
820 825 830 

cct ggc gta acg ctt caa tac tct ctg gat ggt gag aac tgg ttg act 254 4 

Pro Gly Val Thr Leu Gin Tyr Ser Leu Asp Gly Glu Asn Trp Leu Thr 
835 840 845 

tat gca gac aac get cgt cca aat gta act ggt gaa gtc ttc ate cgc 2592 
Tyr Ala Asp Asn Ala Arg Pro Asn Val Thr Gly Glu Val Phe He Arg 
850 855 860 

teg gta tct gcg aca ggt gag aag gta age cgt ate act age gtg aaa 2640 
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Ser Val Ser Ala Thr Gly Glu Lys Val Ser Arg He Thr Ser Val Lys 
865 870 875 880 

taa 2643 
* 

<210> 19 
<211> 880 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> dnaA/Chitobiase Fusion 
<400> 19 

Met Ser Leu Ser Leu Trp Gin Gin Cys Leu Ala Arg Leu Gin Asp Glu 

1 5 10 15 

Leu Val Pro Ser Ser Glu Gin Gin Val Val Asn Ser Leu Ala Asp Asn 

20 25 30 

Leu Asp He Gin Tyr Glu Val Leu Thr Asn His Gly Ala Asn Glu Gly 

35 40 45 

Leu Ala Cys Gin Asp Met Gly Ala Glu Trp Ala Ser Cys Asn Lys Val 

50 . 55 60 

Asn Met Thr Leu Val Asn Gin Gly Glu Ala Val Asp Ser Lys Asp Trp 
65 70 75 80 

Ala He Tyr Phe His Ser He Arg Leu He Leu Asp Val Asp Asn Glu 

85 90 95 

Gin Phe Lys He Ser Arg Val Thr Gly Asp Leu His Lys Leu Glu Pro 

100 105 * 110 

Thr Asp Lys Phe Asp Gly Phe Ala Ala Gly Glu Glu Val Val Leu Pro 

115 120 125 

Leu Val Gly Glu Tyr Trp Gin Leu Phe Glu Thr Asp Phe Met Pro Gly 

130 135 140 

Ala Phe Val Ser Ala Pro Asn Ala Glu Pro Lys Met He Ala Ser Leu 
145 150 155 160 

Asn Thr Glu Asp Val Ala Ser Phe Val Thr Gly Leu Glu Gly Asn Asn 

165 170 175 

Leu Lys Arg Thr Pro Asp Asp Asn Asn Val Phe Ala Asn Ala Val Ser 

180 ~ 185 190 

Arg Phe Glu Lys Asn Glu Asp Leu Ala Thr Gin Asp Val Ser Thr Thr 

195 200 205 

Leu Leu Pro Thr Pro Met His Val Glu Ala Gly Lys Gly Lys Val Asp 

210 215 220 

He Ala Asp Gly He Ala Leu Pro Lys Asp Ala Phe Asp Ala Thr Gin 
225 230 * 235 240 

Phe Ala Ala He Gin Asp Arg Ala Glu Val Val Gly Val Asp Val Arg 

245 250 * 255 

Gly Asp Leu Pro Val Ser He Thr Val Val Pro Ala Asp Phe Thr Gly 

260 265 270 

Glu Leu Ala Lys Ser Gly Ala Tyr Glu Met Ser He Lys Gly Asp Gly 

275 280 285 

He Val He Lys Ala Phe Asp Gin Ala Gly Ala Phe Tyr Ala Val Gin 

290 295 300 

Ser He Phe Gly Leu Val Asp Ser Gin Asn Ala Asp Ser Leu Pro Gin 
305 310 315 320 

Leu Ser lie Lys Asp Ala Pro Arg Phe Asp Tyr Arg Gly Val Met Val 

325 330 335 

Asp Val Ala Arg Asn Phe His Ser Lys Asp Ala He Leu Ala Thr Leu 

340 345 350 

Asp Gin Met Ala Ala Tyr Lys Met Asn Lys Leu His Leu His Leu Thr 
355 360 365 
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Asp Asp Glu Gly Trp Arg Leu Glu lie Pro Gly Leu Pro Glu Leu Thr 

370 375 380 

Glu Val Gly Ala Asn Arg Cys Phe Asp Thr Gin Glu Lys Ser Cys Leu 
385 390 395 400 

Leu Pro Gin Leu Gly Ser Gly Pro Thr Thr Asp Asn Phe Gly Ser Gly 

405 410 415 

Tyr Phe Ser Lys Ala Asp Tyr Val Glu He Leu Lys Tyr Ala Lys Ala 

420 425 430 

Arg Asn He Glu Val He Pro Glu He Asp Met Pro Ala His Ala Arg 

435 440 445 

Ala Ala Val Val Ser Met Glu Ala Arg Tyr Asp Arg Leu Met Glu Glu 

450 455 460 

Gly Lys Glu Ala Glu Ala Asn Glu Tyr Arg Leu Met Asp Pro Gin Asp 
465 470 475 * 480 

Thr Ser Asn Val Thr Thr Val Gin Phe Tyr Asn Lys Gin Ser Phe He 

485 490 495 

Asn Pro Cys Met Glu Ser Ser Thr Arg Phe Val Asp Lys Val He Ser 

500 505 510 

Glu Val Ala Ala Met His Gin Glu Ala Gly Ala Pro Leu Thr Thr Trp 

515 520 525 

His Phe Gly Gly Asp Glu Ala Lys Asn lie Lys Leu Gly Ala Gly Phe 

530 535 540 

Gin Asp Val Asn Ala Glu Asp Lys Val Ser Trp Lys Gly Thr He Asp 
545 550 555 " 560 

Leu Ser Lys Gin Asp Lys Pro Phe Ala Gin Ser Pro Gin Cys Gin Thr 

565 570 575 

Leu He Thr Asp Gly Thr Val Ser Asp Phe Ala His Leu Pro Ser His 

580 585 590 

Phe Ala Glu Glu Val Ser Lys lie Val Ala Glu Lys Gly lie Pro Asn 

595 600 605 

Phe Gin Ala Trp Gin Asp Gly Leu Lys Tyr Ser Asp Gly Glu Lys Ala 

610 615 620 

Phe Ala Thr Glu Asn Thr Arg Val Asn Phe Trp Asp Val Leu Tyr Trp 
625 630 635 640 

Gly Gly Thr Ser Ser Val Tyr Glu Trp Ser Lys Lys Gly Tyr Asp Val 

645 650 ~ 655 

lie Val Ser Asn Pro Asp Tyr Val Tyr Met Asp Met Pro Tyr Glu Val 

660 665 670 

Asp Pro Lys Glu Arg Gly Tyr Tyr Trp Ala Thr Arg Ala Thr Asp Thr 

675 680 685 

Arg Lys Met Phe Gly Phe Ala Pro Glu Asn Met Pro Gin Asn Ala Glu 

690 695 700 

Thr Ser Val Asp Arg Asp Gly Asn Gly Phe Thr Gly Lys Gly Glu lie 
705 710 715 ~ ~ 720 

Glu Ala Lys Pro Phe Tyr Gly Leu Ser Ala Gin Leu Trp Ser Glu Thr 

725 730 * 735 

Val Arg Asn Asp Glu Gin Tyr Glu Tyr Met Val Phe Pro Arg Val Leu 

740 745 750 

Ala Ala Ala Gin Arg Ala Trp His Arg Ala Asp Trp Glu Asn Asp Tyr 

755 760 " 765 

Lys Val Gly Val Glu Tyr Ser Gin Asn Ser Asn Leu Val Asp Lys Ala 

770 775 780 

Ser Leu Asn Gin Asp Tyr Asn Arg Phe Ala Asn Val Leu Gly Gin Arg 
785 790 795 800 

Glu Leu Ala Lys Leu Glu Lys Ser Gly lie Asp Tyr Arg Leu Pro Val 

805 810 815 

Pro Gly Ala Lys Val Glu Asp Gly Lys Leu Ala Met Asn Val Gin Phe 

820 825 830 

Pro Gly Val Thr Leu Gin Tyr Ser Leu Asp Gly Glu Asn Trp Leu Thr 

835 840 845 

Tyr Ala Asp Asn Ala Arg Pro Asn Val Thr Gly Glu Val Phe lie Arg 
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850 855 860 

Ser Val Ser Ala Thr Gly Glu Lys Val Ser Arg lie Thr Ser Val Lys 
865 870 875 880 
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